Human hairless gene and protein

ABSTRACT

The novel nucleotide sequence and deduced amino acid sequence of the human Hairless gene and protein, respectively, are disclosed. A Hairless expression construct may be used in transcription assays. Moreover, processes of making and using the aforementioned products in screening assays which affect Hairless-regulated transcription are disclosed. Kits comprising a polynucleotide, polypeptide, specific binding molecule, or combinations thereof are disclosed.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Appln. No.60/080,888, filed Apr. 7, 1998, the entire disclosure of which isincorporated herein by reference.

STATEMENT REGARDING SPONSORED RESEARCH OR DEVELOPMENT

The invention described herein was made in the course of work undergrant number DK46074 from the National Institutes of Health. The U.S.government may retain certain rights in this invention.

BACKGROUND OF THE INVENTION

1. Field of Invention

The present invention pertains to products and processes useful in thefields of development, genetics, and transcription biochemistry. Theinvention is generally useful in the diagnosis and treatment ofdermatologic conditions.

2. Description of the Related Art

Although hair loss is a problem of great interest to dermatologists andthe lay public, basic knowledge of the biology of hair growth andmaintenance has been limited.

In most mammals, hair does not grow continuously but undergoes cycles ofactivity involving periods of growth, rest, and shedding. On the humanscalp, from 100,000 to 350,000 hair fibers or shafts undergometamorphosis in three distinct stages:

(a) the growth phase (anagen) during which the hair root bulb or dermalpapilla (also called the “follicular papilla”) penetrates deep into thedermis with the cells of the bulb dividing rapidly and differentiatingin the process of synthesizing keratin, the substance of the hair shaftitself. In normal humans, this growth phase is thought to last from oneto five years;

(b) the transitional phase (catagen) is marked by the cessation ofmitosis and lasts from two to three weeks; and

(c) the resting phase (telogen) where the hair is retained within thescalp for up to 12 weeks before the emerging new hair developing belowit dislodges the telogen-stage shaft from its follicle.

Experiments with mouse hair follicles showed that the anagen-stage stemcells, stored within the bulge area of the follicle, proliferate duringearly anagen and migrate to the root bulb region prior todifferentiation. See U.S. Pat. No. 5,279,969. The bulge cells can bestimulated to proliferate in response to physical and chemical stimulicausing telogen follicles to commence anagen. In addition, the physicalproximity of the follicular papilla to the stem cell-containing bulgearea plays a role in the onset of the anagen stage. It has beenspeculated that damage to the bulge region results in permanentalopecia, whereas damage to the hair root bulb alone results in alopeciaareata and is temporary.

It is generally accepted that genetic hair loss arises from activationof an inherited sensitivity to circulating androgenic hormones. Suchandrogenic alopecia is the single most common type of recognizablealopecia to affect both men (50%) and women (30%), primarily ofCaucasian origin. Gradual changes in the width and length of the hairshaft are experienced over time and with increasing age, prematurely insome. Terminal hair is gradually converted to short, wispy, colorlessvellus hair. As a consequence, men in their 20's and women in their 30'sand 40's begin to notice their hair becoming finer and shorter. Inaddition, the ratio of growing hairs to hairs in the resting andshedding phases declines from as high as 9:1 to as low as 2:1.

Androgenic alopecia, or male pattern baldness, is largely the result ofheredity, advancing age, and male hormone secretion, specifically thehormone dihydrotestosterone (DHT). At advanced stages, male patternbaldness is characterized by a bald scalp at the crown of the head and ahorseshoe shaped fringe of hair remaining on the sides of the head. Malepattern baldness may be mediated by time-dependent, steroidhormone-regulated gene expression that results in a diminution in thegrowing phase of scalp hair.

Minoxidil, a potent anti-hypertensive medication, has been used withlimited success to treat male pattern baldness by topical application tothe scalp. See U.S. Pat. Nos. 4,139,619 and 4,596,812. A 2% or 5%solution containing alcohol and polyethylene glycol is used. One theoryfor its mode of action is that blood vessels are dilated and theincreased blood supply stimulates nourishment of hair follicles. Manypatients, however, do not achieve a satisfactory result (youngerpatients and patients with less hair loss have better results), thedegree of new hair growth is usually minimal, and the area of the scalpthat is affected is usually limited to the vertex cranii. Minoxidil'seffectiveness for the treatment of androgenic alopecia may be limitedbecause it does not reduce production of the hormones responsible forcausing male pattern baldness.

Therefore, another approach for treating male pattern baldness has beenthe administration of agents which inhibit the conversion oftestosterone to DHT. Testosterone binds specifically to the 5alpha-reductase enzyme which converts testosterone to its activemetabolite DHT. In turn, DHT binds to nuclear receptor proteins and mayregulate the synthesis of specific proteins which lead to male patternbaldness.

An orally administered inhibitor of 5 alpha-reductase currentlyprescribed for the treatment of male pattern baldness is finasteride, asynthetic 4-azasteroid compound. See U.S. Pat. Nos. 4,377,584;4,760,071; 5,547,957; and 5,571,817. Finasteride is more convenientlyadministered than minoxidil and is more effective than minoxidil intreating androgenic alopecia. However, finasteride also has undesirableeffects which include reducing libido, erection, and semen volume inmen; and causing fetal defects in pregnant women.

A genetic approach to developing pharmaceutical candidates is to screensmall molecules for modulation of transcription factor activity whichregulates hair growth and/or maintenance. For example, Tularik hasdescribed high-throughput assays for screening candidate chemical agentswhich modulate transcription mediated by sequence-specific transcriptionfactors. But a human transcription factor essential to the growth and/ormaintenance of hair was not available until the invention described inthe present application.

For all of the above reasons, it was necessary to develop molecularprobes and genetic models for hair loss. In hr/hr (hairless) mutantmice, initial hair growth is normal but, after the first wave ofshedding, hair fails to grow back and complete loss of hair results. Inthis respect, the development of hair loss resembles alopeciauniversalis caused by a rare inherited mutation in humans (see Ahmad etal., Science, 279, 720-724, 1998; Cichon et al., Hum. Mol. Genet., 7,1671-1679 and 1987-1988, 1998). In addition, mutant mice show increasedsensitivity to ultraviolet (UV) radiation and chemical-induced skincarcinogenesis. The human Hairless gene is identified and characterizedherein.

SUMMARY OF THE INVENTION

The invention provides isolated polynucleotides corresponding to thehuman Hairless (HR) gene and their nucleotide sequences, isolatedpolypeptides and amino acid sequences of the Hairless (Hr) protein, andfragments thereof.

These products may be used in processes to detect Hairless gene orprotein expression, to inhibit Hairless gene or protein expression, toisolate Hairless polynucleotide or polypeptide from a crude mixture, toproduce Hairless polypeptide, to identify a binding molecule specificfor Hairless, to isolate the specific binding molecule from a crudemixture, to detect the specific binding molecule in a crude mixture, toidentify an agent which specifically binds Hairless, to isolate thespecific binding agent from a crude mixture, to detect the specificbinding agent in a crude mixture, to transfect a host cell, and toproduce a non-human transgenic animal.

The present invention is also directed to screening methods to identifyagents that affect expression of the human HR gene or transcriptionalactivity of the human Hr protein.

Uses of Hairless polynucleotide, polypeptide, and specific bindingmolecule are further described below. Kits comprising the aforementionedproducts are also provided to practice the described processes; suchkits would further comprise instructions for performing the processesand/or standards to calibrate diagnostic assays and/or other reagents toperform the processes.

BRIEF DESCRIPTION OF THE DRAWING

Panels A-M of FIG. 1 show a comparison of the amino acid sequences ofthe present invention (Hum Hr, SEQ ID NO:2); a human sequence (accessionnumber AFO39196, SEQ ID NO:3) published by Ahmad et al. (Science, 279,720-724, 1998); a human sequence (accession number not available, SEQ IDNO:4) published by Cichon et al. (Hum. Mol. Genet., 7, 1671-1679 anderratum at 1987-1988, 1998); a rat sequence (accession number U71293,SEQ ID NO:5) published by Thompson (J. Neurosci., 16, 7832-7840, 1996);and a mouse sequence (accession number Z32675, SEQ ID NO:6) published byCachon-Gonzalez et al. (Proc. Natl. Acad. Sci. USA, 91, 7717-7721,1994). An asterisk indicates a position at which there is a differenceamong the human sequences.

DESCRIPTION OF PREFERRED EMBODIMENTS

This invention is based on the discovery of a polynucleotide whichencodes a homolog of the mouse hairless gene. Polynucleotides andpolypeptides of the present invention represent molecules which could bedetected diagnostically or targeted therapeutically in vitro, ex vivo,or in vivo. These polynucleotides and their predicted translationproducts are unique as compared to nucleotide and amino acid sequences,respectively, which are known in the prior art.

According to one aspect of invention, a complementary DNA sequence(cDNA) representing an about 5 Kb messenger RNA (mRNA) transcript inhuman cells may be monitored by polynucleotide detection techniques.Nucleotide sequence specific for Hairless can be used as a probe. Suchprobes could be full length covering the entire transcribed message orgene, at least one coding region, or a shorter length fragment which isunique to the Hairless transcript or gene but contains only a portion ofsame. The polynucleotide may be at least 20 bases to 500K bases long(e.g., 20, 30, 50, 100, 250, 500,1000, 2500, 5000, 10 K, 20 K, 40 K,100K, 250K, or 500K bases).

The Sanger and Maxam-Gilbert sequencing reactions produce a collectionof polynucleotide fragments by enzymatic and chemical methods,respectively. These fragments may be separated by electrophoresis as aladder of bands with different mobilities, detected by labeling, andisolated by by collecting the moving zone containing the desiredfragment with a relative mobility predicted from comparison to astandard. In contrast to template-dependent extension and limitedhydrolysis, polynucleotide fragments may be produced by limited orcomplete nuclease digestion. Caruthers' phosphoramidite synthesis mayalso be used to produce short oligonucleotides. Chromatography and massspectroscopy are alternative methods of separation and detection,respectively.

The polypeptide of the invention has an amino acid sequence which may bepredicted from the nucleotide sequence of the aforementionedpolynucleotide. Full length polypeptide or a shorter length polypeptidefragment can be produced with the predicted amino acid sequence. Thelength of the polypeptide may be in the range of 5 residues to 1250residues (e.g., at least 10, 25, 50, 75, 100, 250, 500, 750 or 1000residues). Translation of the aforementioned polynucleotide fragmentsmay be used to produce any desired polypeptide fragment. Edmandegradation produces a series of polypeptide fragments which areseparated during amino acid sequencing. Fragments may also be producedby chemical or proteolytic hydrolysis. Short oligopeptides may bechemically synthesized by Merrifield's method. Polypeptides may beseparated by electrophoresis, velocity sedimentation, or chromatography;they may be detected by labeling, their spectra of a radiated oradsorbed electromagnetic wave, or mass spectroscopy.

Related nucleotide or amino acid sequences are found when there issimilarity or identity of sequence and this may be determined bycomparison of sequence information, nucleotide or amino acid, or throughhybridization between a human Hairless probe and a candidate source(e.g., Southern or Northern blots, genomic or cDNA libraries).Conservative changes, such as Glu/Asp, Val/Ile, Ser/Thr, Arg/Lys andGln/Asn may also be considered in determining sequence similarity.

Typically, a nucleotide sequence may show as little as 80% sequenceidentity, and more preferably at least 90% sequence identity, betweenthe target sequence and the human Hairless polynucleotide excluding anydeletions or additions which may be present, and still be consideredrelated. Nucleotide sequence identity may be at least 95% and, mostpreferably, nucleotide sequence identity is at least 98%. Amino acidsequences are considered to be related with as little as 90% sequenceidentity between the two polypeptides; however, 95% or greater sequenceidentity is preferred and 98% or greater sequence identity is mostpreferred.

Hairless is well conserved between rodents and man. Thus, the use ofcomplex mathematical algorithms is not required because amino acidsequences can be aligned without introducing many gaps. But suchalgorithms are known in the art, and implemented using defaultparameters in commercial software packages provided by DNASTAR, GeneticsComputer Group, Hitachi Genetics Systems, and Oxford Molecular Group(formerly Intelligenetics). See Doolittle, Of URFS and ORFS, UniversityScience Books, 1986; Gribskov and Devereux, Sequence Analysis Primer,Stockton Press, 1991; and references cited therein. Percentage identitybetween a pair of sequences may be calculated by the algorithmimplemented in the BESTFIT computer program (Smith and Waterman, J. Mol.Biol., 147, 195-197, 1981; Pearson, Genomics, 11, 635-650, 1991).Another algorithm that calculates sequence divergence has been adaptedfor rapid database searching and implemented in the BLAST computerprogram (Altschul et al., Nucl. Acids Res., 25, 3389-3402, 1997).

Conservative amino acid substitutions, such as Glu/Asp, Val/Ile,Ser/Thr, Arg/Lys and Gln/Asn, may also be considered when makingcomparisons because the chemical similarity of these pairs of amino acidresidues would be expected to result in functional equivalency. Aminoacid substitutions that are expected to conserve the biological functionof the native Hairless polypeptide would conserve chemical attributes ofthe substituted amino acid residues such as hydrophobicity,hydrophilicity, side-chain charge, or size. Functional equivalency orconservation of biological function may be evaluated by methods forstructural determination and bioassay as disclosed herein. Thus, aminoacid sequences are considered to be related with as little as 90%sequence similarity between the two polypeptides; however, 95% orgreater sequence similarity is preferred and 98% or greater sequencesimilarity is most preferred.

The codons used in the native nucleotide sequences may be adapted fortranslation in a heterologous host by adopting the codon preferences ofthe host. This would accommodate the translational machinery of theheterologous host without a substantial change in the chemical structureof the polypeptide.

While the rat and mouse Hairless sequences may be used as a source formaking variant human polynucleotides and polypeptides, the full lengthrat and mouse Hairless sequences are not within the scope of the presentinvention.

A recombinant clone or expression construct containing a Hairlessnucleotide sequence is a preferred form of the polynucleotide of thepresent invention. The recombinant clone or expression clone may be anepisome, phagemid, plasmid, bacteriophage, cosmid, yeast artificialchromosome (YAC), or bacterial artificial chromosome (BAC). Such cloneor construct could be single- or double-stranded; nucleotides may bedeoxyribonucleosides, ribonucleosides, nucleosides with a modified base,nucleotides with a modified ribose, or combinations thereof; linkagesbetween nucleotides may be comprised of phosphorus, nitrogen, sulfur,oxygen, carbon, or combinations thereof.

The expression construct is further comprised of a regulatory region forgene expression (e.g., promoter, enhancer, silencer, splice donor andacceptor sites, polyadenylation signal, cellular localization sequence)and, optionally, an origin of replication that allows chromosomal orepisomal replication in a selected host cell. The expression constructmay be based on a general-purpose vector with at least on regulatoryregion from a mammalian gene (e.g., actin, hormone responsive element ofglucocorticoid receptor, histone, metallothionein) or a virus (e.g.,adenovirus, baculovirus, cytomegalovirus, herpes virus, Moloney leukemiavirus, mouse mammary tumor virus, Rous sarcoma virus, SV40 virus), aswell as regions suitable for genetic manipulation (e.g., selectablemarker, linker with multiple recognition sites for restrictionendonucleases, promoter for in vitro transcription, primer annealingsites for in vitro replication, recognition site mediating site-specificrecombination). The advantages of such clones or constructs may includeease of genetic manipulation, a source of replicated copies, and theability to shuttle between different host cells or organisms.

Production of such vectors and constructs, like any recombinantmolecule, are known in the art and typically involves enzymes, such asTaq polymerase, DNA and RNA polymerases, DNA and RNA ligases,restriction endonucleases, S1 nuclease, reverse transcriptase, andribonuclease H. See Kornberg and Baker, DNA Replication, Freeman, 1991.The recombinant molecule may be transfected into a host, selectedpositively or negatively, and further manipulated.

Vectors, reagents, and other supplies are commercially available. See,for example, the catalogs and product information of Amersham PharmaciaBiotech, Bio101, Bio-Rad, CLONTECH, Invitrogen, Molecular Probes, NewEngland Biolabs, Novagen, PharMingen, Pierce Chemical, Promega, RocheMolecular Biochemicals, Sigma-Aldrich, Stratagene, and United StatesBiological.

A heterologous promoter may be especially useful to regulate expressionin a host cell or transgenic animal. See No et al., (Proc. Natl. Acad.Sci. USA, 93, 3346-3351, 1996); Rivera et al. (Nat. Med., 2, 1028-1032,1996); Allgood and Eastman (Curr. Opin. Biotechnol., 8, 474-479, 1997);U.S. Pat. Nos. 5,589,362; 5,650,298; and 5,654,168.

The invention also provides primer pairs and other polynucleotides foruse in amplifying polynucleotides (e.g., polymerase chain reaction orPCR, ligation chain reaction or LCR, transcription-mediatedamplification or TMA, other thermal cycling or isothermal reactions) andhybridization probes. A set of such primers may be selected and used forPCR assays to quantitate Hairless transcript abundance within cells.Oligonucleotide sequences may be selected using methods such as thosedescribed in U.S. Pat. Nos. 5,556,749 and 5,639,612, or others known tothe skilled artisan. Therefore, this invention will be useful fordevelopment and utilization of Hairless primers and otherpolynucleotides to quantitate cognate RNA and DNA within cells. Thisinformation may then be used to correlate hair growth/loss with Hairlessexpression or Hairless-regulated transcription. Primers thatspecifically amplify sequences in the vicinity of the Hairless geneticlocus also serve as a sequence tagged site (STS) for 8p12-21.

A host cell may be transfected with an expression construct comprised ofthe polynucleotide of the invention. The host cell may be a human cellline, bacterium, yeast, insect cell, plant cell, rodent cell, cell in aprimary culture, established cell line, somatic cell, or stem cell(e.g., fibroblast, neuron, glial). This invention also provides Hairlesstransgenic non-human animals and mutants (e.g., site-directed mutationsof the human Hairless gene) thereof, and mutants of human somatic cellsby using the polynucleotides of the invention. Preferably, thetransfected cell or transgenic animal will express the human Hairlessgene or a variant thereof. A cell or organism without a Hairless gene(e.g., null mutant, gene knockout) is a preferred host for introductionof the human Hairless gene or a variant thereof because results offunctional assays will not be confounded by endogenous Hairlessactivity.

Based on the Hairless nucleotide sequences, a specific binding molecule(e.g., antisense oligonucleotide or ribozyme) can be used to inhibitHairless gene expression in an organism. Alternatively, specific bindingmolecules developed to neutralize the Hairless protein activity may beproduced in or administered to an organism. Algorithms to guide theselection of hybridization probes, oligonucleotide primers, polypeptidebinding molecules, and antigenic peptides have also been implemented incomputer software packages.

This invention provides isolated polypeptide having biological activityof Hairless and a method for preparation of such polypeptides. Theresidues of the polypeptide may be natural amino acids, designer aminoacids, or non-classical amino acids of either the D or L optical isomer;the residues may be modified by acylation, glycosylation, methylation,phosphorylation, sulfation, or combinations thereof; and linkagesbetween residues may be comprised of phosphorus, nitrogen, sulfur,oxygen, carbon, or combinations thereof. The amino acid sequence ofHairless antigen can be used for preparation of specific bindingmolecules (e.g., polyclonal or monoclonal antibody, antibody fragment,humanized antibody, single chain antibody, phage hybrid protein or othermembers of a combinatorial library) for monitoring protein expression,affinity purification, and functional studies.

Antibody may be produced by immunizing an animal (e.g., chicken, goat,hamster, horse, mouse, rabbit, rat) with Hairless antigen. The immuneresponse may be potentiated by immunoadjuvant, conjugation of antigen toa multivalent carrier, booster immunization, or combinations thereof.Antibody fragments may be prepared by proteolytic cleavage or geneticengineering; humanized antibody and single chain antibody may beprepared by transplanting sequences from the antigen binding regions ofantibodies to framework molecules. Specific binding molecules may alsobe generally produced by screening a combinatorial library for a clonewhich specifically binds Hairless antigen (e.g., phage display library).See U.S. Pat. Nos. 5,403,484; 5,723,286; 5,733,743; 5,747,334; and5,871,974. Antigen may be full length Hairless polypeptide or at leastone fragment thereof.

For immunological screening methods, antibody preparations, eithermonoclonal or polyclonal, may be utilized. Polyclonal antibodies,although generally less specific, typically are more useful in geneisolation. Immunizing an animal may produce polyclonal antibody whichrecognizes multiple epitopes of the Hairless antigen or at least oneimmunodominant epitope. Monoclonal antibody may be produced by fusinglymphocytes of an immunized animal with a myeloma or other immortalizedcell, and selecting clones producing antibody that recognizes a desiredHairless epitope or possesses a desired properties (e.g., activating orneutralizing an activity of Hairless polypeptide, precipitating Hairlesspolypeptide or a fragment thereof). The epitope bound by the antibodymay be present in the native Hairless polypeptide or only in a denaturedform of Hairless.

A molecule able to specifically hybridize to a polynucleotide of theinvention (e.g., a complementary polynucleotide) is also considered aspecific binding molecule. Hybridization conditions are preferablychosen with a stringency that uniquely identifies the human Hairlessgene in a population of genomic DNA or the human Hairless transcript ina population of cellular RNA. Conditions may also be chosen todistinguish human Hairless nucleotide sequences from those of otherspecies by a physical criterion like sedimentation velocity orelectrophoretic mobility. Alternatively, hybridization conditions may berelaxed to identify orthologs or paralogs in human or other mammalianspecies. Specific hybridization by such a molecule is also useful formonitoring gene expression, genetic profiling and fingerprinting, andfunctional studies.

The specific binding molecule of the invention may be a chemicalmimetic; for example, an aptamer or peptidomimetic. It is preferably ashort oligomer selected for binding affinity and bioavailability (e.g.,passage across the plasma and nuclear membranes, resistance tohydrolysis of oligomeric linkages, adsorbance into cellular tissue, andresistance to metabolic breakdown). The chemical mimetic may bechemically synthesized with at least one non-natural analog of anucleoside or amino acid (e.g., modified base or ribose, designer ornon-classical amino acid, D or L optical isomer). Modification may alsotake the form of acylation, glycosylation, methylation, phosphorylation,sulfation, or combinations thereof. Oligomeric linkages may bephosphodiester or peptide bonds; linkages comprised of a phosphorus,nitrogen, sulfur, oxygen, or carbon atom (e.g., phosphorothionate,disulfide, lactam, or lactone bond); or combinations thereof. Thechemical mimetic may have significant secondary structure (e.g., aribozyme) or be constrained (e.g., a cyclic peptide). Solid-phasesynthesis is preferred to avoid representational bias and to generatechemical diversity in making a library of non-natural mimetics. See, forexample, U.S. Pat. Nos. 5,650,489 and 5,877,030. Cleavage from the solidsupport would produce a solution library or selectively release/retainthe mimetic.

For detection, the antibody is labeled using radioactivity or any one ofa variety of second antibody/enzyme conjugate systems that arecommercially available (e.g., alkaline phosphatase, β-galactosidase,horseradish peroxidase). Chemical staining may be used to detectpolynucleotide (e.g., acridine orange, ethidium bromide) or polypeptide(e.g., amido black, coomassie brilliant blue) of the invention.Typically, polynucleotide, polypeptides, and specific binding moleculeare labeled for use as probes in assays of the invention. Preferably theprobe is labeled with a small molecule (e.g., biotin, chromochrome,colloidal gold, digoxygenin, dinitrophenol, fluorochrome, radioisotope,spin label), although enzymes (e.g., alkaline phosphatase,β-galactosidase, peroxidase) or other methods (e.g., agglutination,chemiluminescence, electron spin resonance, energy transfer,flocculation, nuclear magnetic resonance, spectroscopy, surface plasmonresonance) may be used to detect the probe.

Biological functions or activities of Hairless include, but are notlimited to, transcription, growth and maintenance of hair, resistance toUV radiation and chemical-induced skin carcinogenesis, neuraldevelopment, neurological or behavioral characteristics, and othereffects of thyroid hormone mediated through Hairless.

Modulation of gene expression may be effected by affectingtranscriptional initiation, transcript stability, translation of thetranscript into protein product, protein stability, or combinationsthereof. Quantitative effects can be measured by conventional techniquessuch as in vitro transcription, in vitro translation, Northernhybridization, polynucleotide hybridization, reversetranscription-polymerase chain reaction (RT-PCR), run-on transcription,solution hybridization, nuclease protection, Southern hybridization,cell surface protein labeling, metabolic protein labeling, antibodybinding, enzyme linked immunosorbent assay (ELISA), immunofluorescence,immunoprecipitation (IP), fluorescence activated cell analysis (FACS),radioimmunoassay (RIA), and western blotting.

Gene expression is conveniently assayed by use of a reporter orselectable marker gene whose protein product is easily assayed. Suchreporter genes include alkaline phosphatase, β-galactosidase (LacZ),chloramphenicol acetyltransferase (CAT), β-glucoronidase (GUS), greenfluorescent protein (GFP), β-lactamase, luciferase (LUC), or derivativesthereof. Such reporter genes would use cognate substrates that arepreferably assayed by a chromogen, fluorescent, or luminescent signal.Alternatively, the assayed product may be tagged with a heterologouspolypeptide epitope (e.g., FLAG, MYC, SV40 T antigen,glutathione-S-transferase or GST, hexahistidine, maltose binding proteinor MBP) for which cognate antibodies or affinity resins are commerciallyavailable. Examples of drugs for which selectable marker genes exist areampicillin, hygromycin, kanamycin/neomycin, puromycin, and tetracycline.A metabolic enzyme (e.g., dihydrofolate reductase, thymidine kinase) maybe used as a selectable marker in sensitive host cells or auxotrophs.

It is a particular object of the invention to provide processes forscreening candidate chemical agents for the ability to modulateexpression of the HR gene and/or activity of the Hr protein. It isanother object of the invention to provide reporter constructs andexpression systems for screening candidate chemical agents. Yet anotherobject of the invention to provide processes for identifying candidatechemical agents to regulate the growth and/or maintenance of hair.

In such embodiments of the invention, a method is provided for screeningcandidate chemical agents for the ability to modulate hair developmentand/or cell differentiation by activating HR-regulated gene expressionor by inhibiting HR-regulated gene expression. Moreover, a method isprovided for screening candidate chemical agents for use in modulatingmaintenance and/or growth of hair. Furthermore, a method of screeningcandidate chemical agents which modulate the binding of Hr to thyroidhormone receptor is provided to regulate hairless transcriptionalactivity. A high-throughput screening assay is preferred.

A screening method may comprise administering a candidate chemical agentto an organism, or incubating a candidate chemical agent to a cell ortissues, and directly assaying for modulation of HR gene activity or Hrprotein activity. Modulation may be an increase or decrease in activity.HR gene or Hr protein activity may be increased at the level of rate oftranscript initiation, rate of transcript elongation, stability oftranscript, translation of transcript, rate of translation initiation,rate of translation elongation, stability of protein, rate of proteinfolding, proportion of protein in active conformation, functionalefficiency of protein (e.g., binding constant for DNA, activation orrepression of transcription), or combinations thereof. See, for example,U.S. Pat. Nos. 5,071,773 and 5,262,300.

A screening method may comprise incubating a candidate chemical agentwith a cell containing a reporter construct, the reporter constructcomprising an Hr-responsive transcription regulatory region covalentlylinked in a cis configuration to a downstream gene encoding an assayableproduct; and measuring production of the assayable product. A candidatechemical agent which increases production of the assayable product wouldbe identified as an agent which activates gene expression from theHR-responsive region, and a candidate chemical agent which decreasesproduction of the assayable product would be identified as an agentwhich inhibits gene expression from the HR-responsive region. See, forexample, U.S. Pat. Nos. 5,849,493 and 5,863,733.

A screening method may comprise measuring in vitro transcription from areporter construct incubated with Hr protein (or transcriptionallyactive fragment thereof) in the presence or absence of a candidatechemical agent, the reporter construct comprising a transcriptionregulatory region which is responsive to Hr protein (ortranscriptionally active fragment thereof); and determining whethertranscription is altered by the presence of the candidate chemicalagent. In vitro transcription is preferably assayed using a cell-freeextract (more preferably, a nuclear extract); partially purifiedfractions of the cell-free extract; purified transcription factors orRNA polymerase; or combinations thereof. See U.S. Pat. Nos. 5,453,362;5,534,410; 5,563,036; 5,637,686; 5,708,158; and 5,710,025.

A screening method may comprise incubating Hr protein (or fragmentthereof) with a candidate chemical agent and thyroid hormone receptor(or fragment thereof); and determining the amount of the thyroid hormonereceptor (or fragment thereof) which is bound to Hr protein (or fragmentthereof), the desired chemical agent being one which increases ordecreases binding. Preferably, at least one of the Hr protein (orfragment thereof) and the thyroid hormone receptor (or fragment thereof)is immobilized to a solid substrate to facilitate separation of boundfrom unbound complexes.

Methods for measuring transcriptional or translational activity in vivocan be any which are known. For example, a nuclear run-on assay may beemployed to measure transcription of a reporter gene. The translation ofthe reporter gene may be measured by determining the activity of thetranslation product of the reporter gene. Methods for measuring theactivity of an assayable product of certain reporter genes are wellknown.

In a preferred embodiment, the above methods are assayed in vitro withor without hairless transcription activity. This may be accomplished byusing cell or proteins from sources without hairless transcriptionactivity such as mouse hr/hr mutant cells or animals, gene knockoutsomatic cells or animals, or cells which do not normally expresshairless. In another preferred embodiment, the transcription is assayedwith or without thyroid hormone, thyroid hormone receptor, or atdifferent stages of development or cell differentiation.

Candidate chemical agents can also be screened for use in regulating thegrowth and/or maintenance of hair by their ability to regulate theactivity of Hr protein. The ability of a candidate chemical agent toregulate the transcriptional activity of Hr protein may be assessed bymeasuring transcription from an Hr-responsive regulatory region.

A transcription reaction comprises a regulatory region responsive to Hrprotein and a reporter gene. The reporter gene operably linked to theregulatory region in a reporter construct could be any gene known in theart. In a preferred embodiment, the length of the promoter region to beassayed is less than 200 bp and no more than 1000 bp. The regulatoryregion in the reporter construct can be any polynucleotide to which Hrprotein binds by itself or in a complex (e.g., with thyroid hormonereceptor). The regulatory region is responsive to Hr protein whichregulates transcription of the reporter gene downstream from andadjacent to the regulatory region. One possible example of suchregulatory regions comprises the upstream sequence of the Hairless gene,especially sequences which are identified by a consensus thyroid hormonereceptor binding site (see SEQ ID NOS:7-10). Other regulatory regionsmay be identified and isolated by selection according tosequence-specific binding of Hr protein. See U.S. Pat. Nos. 5,747,253;5,869,241; and 5,888,738.

Suitable methods for measuring in vitro transcription are known. Invitro transcription may be carried out by incubating a reporterconstruct, labeled nucleotides (e.g., ³²P-ATP), transcriptionally activecell-free extract, nucleotides, and buffer reagents in the presence andabsence of a candidate chemical agent. The procedures for producingcell-free extracts and partially purified fractions are well-describedin the art; the conditions for in vitro transcription are also wellknown. The labeled transcript can be separated by slab or capillary gelelectrophoresis, detected by autoradiography, and quantitated by anytechnique known in the art. Optionally, in vitro transcription can becarried out in the presence of Hr protein and/or thyroid hormonereceptor.

A candidate chemical agent which increases production of an assayableproduct in the cell indicates the potential to increase expression ofthe HR gene or a downstream target gene of the Hr protein. A candidatechemical agent which increases the level of in vitro transcriptionindicates its ability to enhance the activity of the transcriptionregulatory Hr protein. Candidate chemical agents which increaseexpression of the HR gene or its downstream target gene can potentiatethe growth and/or maintenance of hair. These agents can potentially beadministered to a human.

A candidate chemical agent which decreases production of assayableproduct in the cell indicates the potential for the agent to decreaseexpression of the HR gene or its downstream target gene. A candidatechemical agent which decreases the level of in vitro transcriptionindicates its ability to reduce the activity of the transcriptionregulatory Hr protein. Candidate chemical agents which decreaseexpression of the HR gene or its downstream target gene can inhibit thegrowth and/or maintenance of hair. These agents can potentially beadministered to a human.

According to another embodiment of the invention, candidate chemicalagents regulating the binding between Hr protein and thyroid hormonereceptor may be identified. Hr protein can be attached to an insolublepolymeric support such as acrylamide, agarose, cellulose, or plastics,or other supports such as glass. A candidate chemical agent is incubatedwith the immobilized Hr protein in the presence of thyroid hormonereceptor. Alternatively, thyroid hormone receptor can be immobilized ona solid support and a candidate chemical agent can be incubated with theimmobilized thyroid hormone receptor in the presence of Hr protein.After incubation, non-binding components can be washed away, leavingthyroid hormone receptor bound to Hr protein/solid support or Hr proteinbound to thyroid hormone receptor/solid support, respectively. Washingmay be facilitated by forming the solid support into a bilious strip, awell of a 96-well plate, a bead, a chromatography column, or a porousmembrane. Solution transfer may be accomplished by fluid channels,magnetic particles, or robotics.

The amount of HR protein or thyroid hormone receptor can be quantifiedby any means known in the art. For example, it can be determined using abinding assay detected by autoradiography, enzyme colorimetry,excitation energy transfer, fluorescence polarization, fluorescencequenching, liquid scintillation, or surface plasmon resonance. Theamount of bound Hr protein or thyroid hormone receptor may be comparedwith and without the candidate chemical agent. A desirable agent is onewhich increases or decreases the binding of Hr protein to thyroidhormone receptor.

Although the binding of Hr protein and thyroid hormone receptor isdescribed above, it should be understood that binding between Hr proteinand another transcription factor (i.e., hetero-oligomers) or formationof Hr homo-oligomers may be assayed in a similar manner. A complex of Hrprotein and ROR receptor may be formed, or Hr protein may form a complexwith another orphan receptor. Such complexes may be formed and assayedin the presence or absence of ligand, with or without cognate nucleotiderecognition sequence, or combinations thereof. Hr protein may not bind anucleotide recognition sequence. However, sequence specificity may bechanged or conferred by joining Hr protein to a heterologous DNA-bindingdomain (DBD) of known sequence specificity.

Bound complex may be visualized by X-ray crystallography or nuclearmagnetic resonance spectroscopy to identify contact points betweensubunits of the oligomer. Small molecule mimetics can be designed toincrease or decrease formation of oligomers. See U.S. Pat. Nos.5,790,421 and 5,835,382.

The identification of other genes and proteins whose expression oractivity is Hr-dependent will provide additional targets for drugdevelopment. Gene expression profiles may be compared prior to and afterinduction of HR transcription or Hr transcriptional activity.Transcription of Hr-dependent genes may be activated by addition ofthyroid hormone (comparing with and without Hr activity), or byintroducing the HR gene under the control of an inducible promoter intoa host cell that lacks endogenous HR transcription.

Hr-dependent genes may be identified by techniques detectingdifferential expression such as a subtractive cDNA library screened withpost-induction transcripts minus pre-induction transcripts, or bydifferential screening of cDNA or genomic clone libraries. Differentialmessage display (U.S. Pat. Nos. 5,459,037; 5,599,672; 5,665,544;5,707,807; 5,807,680; 5,814,445; 5,851,805; and 5,876,932); subtractivehybridization (U.S. Pat. Nos. 5,316,925; 5,643,761; 5,804,382;5,830,662; 5,837,468; 5,846,721; and 5,853,991); computer-assistedcomparison with an electronic database (e.g., U.S. Pat. No. 5,840,484);differential screening of arrayed cDNA clones or libraries (e.g., U.S.Pat. Nos. 4,981,783; 5,206,152; and 5,624,801); reciprocal subtractiondifferential display (RSDD; U.S. Pat. No. 5,882,874); and serialanalysis of gene expression (SAGE; U.S. Pat. No. 5,866,330) may be usedto identify Hr-dependent genes.

The Hr-dependent gene transcripts will be translated into Hr-dependentproteins, such proteins may be identified by comparing the pattern ofproteins expressed prior to and after induction of Hr (with or withoutthyroid hormone). For example, pre- and post-induction cultures of thehost cells may be ³⁵S-pulsed, protein extracts may be made from wholecell lysates or subcellular fractions, and Hr-dependent proteins will beidentified by their increased or decreased signal intensity intwo-dimensional gels of ³⁵S-labeled proteins from pre- andpost-induction cultures. Proteins of interest (i.e., labeled proteinswhich increase or decrease in abundance) may be isolated, N-terminal orinternal peptide amino acid sequence may be determined, and theHr-dependent genes of interest identified by cloning with degeneratepolynucleotides whose sequences are predicted according to thedetermined amino acid sequence.

Hr-dependent genes may also be identified by promoter trapping. Hr maybe induced in cells after introducing the HR gene under the control ofan inducible promoter into a host cell that lacks endogenous HR geneexpression or Hr activity. A clone library of gene fragments insertedinto a promoter probe vector can be constructed to operably link thegene fragment with a reporter gene, such that a promoter contained inthe gene fragment may direct the transcription of the indicator gene. Asuitable indicator gene will be transcribed and produce a detectableindicator product under appropriate assay conditions. Individual clonesof the library may be introduced into the host cell, and coloniesreplica plated under conditions of hairless induction or non-induction.Gene fragments will be isolated from colonies which produce indicatorproduct only when hairless activity is induced because they couldcontain Hr-dependent promoters. Alternatively, a construct containingthe indicator gene but no operably linked promoter may be randomlyintegrated into the chromosome of a cell. Clones which containintegrations near Hr-dependent promoters may be identified afterinduction of hairless activity by screening for the indicator product.Those integration sites could mark the sites of Hr-dependent promotersand isolating the Hr-dependent genes associated with such promoters mayalso identify Hr-dependent genes.

Differentially expressed genes may be isolated and cloned throughdifferential message display, RNA fingerprinting, representationaldifference analysis (RDA), subtractive hybridization, substractionbetween electronic databases, differential screening of arrayed cDNAclones or libraries, reciprocal subtraction differential display, serialanalysis of gene expression, and generation of expressed sequence tags(ESTs). See Soares (Curr. Opin. Biotechnol., 8, 542-546, 1997) andreferences cited therein. Hairless-regulated genes whose expression iscorrelated temporally with at least one of the anagen/catagen/ telogenstages of the cycle, or spatially in a balding region of the scalp areespecially desired.

Transcriptional and/or translational fusions of Hairless and aheterologous polynucleotide or polypeptide, respectively, are consideredto be encompassed by the invention. In a transcriptional fusion, anon-translated region of the heterologous gene may be ligated to theHairless gene or, alternatively, a non-translated region of the Hairlessgene may be ligated to the heterologous gene. The reading frames ofHairless polypeptide and a heterologous polypeptide may be joined in atranslational fusion. If a reporter or selectable marker is used as theheterologous polynucleotide/polypeptide, then the effect of mutating thenucleotide/amino acid sequences of Hairless or heterologouspolynucleotide/polypeptide on Hairless function may be readily assayed.In particular, a transcriptional fusion may be used to localize aregulated promoter of the Hairless gene and a translational fusion maybe used to localize Hairless protein in the cell. For polypeptidefusions, a peptide recognition site for a protease (e.g., enterokinase,Factor Xa, thrombin) may also be included to separate peptide domainsand to isolate them from each other.

For example, a chimera with domains from Hairless and a heterologousprotein may be produced. The heterologous protein may be a transcriptionfactor such as those described in Locker, Transcription Factors:Essential Data, Wiley, 1996. The domain may include a sequence motifsuch as a helix-turn-helix, a zinc finger, a leucine zipper, orcombinations thereof. A function such as sequence-specific binding,activation or silencing of transcription, oligomerization, orcombinations thereof may be associated with the domain. The domain maybe derived from another mammalian transcription factor (e.g., nuclearhormone receptor with an identified ligand or orphan receptor), aprokaryotic transcription factor (e.g., LexA), or a lower eukaryotictranscription factor (e.g., GAL4).

According to another aspect of the invention, the Hairless DNA istranscribed to produce Hairless RNA transcript, the Hairless RNA istranslated to produce Hairless nascent chain, the Hairless nascent chainfolds to produce Hairless protein in its native conformation, and thenative Hairless protein is processed to produce a modified Hairlessprotein with any native post-translation modifications (e.g., acylation,disulfide linkage, glycosylation, phosphorylation, proteolytic cleavage,sulfation). Nascent chain, native protein, and modified protein areknown generically as polypeptide. In analogy to rodent sequences, humanHairless polypeptide may have a relative mobility of about 127 KDa indenaturing SDS-PAGE.

Hairless polypeptide and its variants (i.e., deletion, domain shuffling,insertion, substitution, and combinations thereof) are useful fordetermining structure-function relationships (e.g., alanine scanning,conservative or non-conservative amino acid substitution). See Wells(Bio/Technology, 13, 647-651, 1995) and U.S. Pat. No. 5,534,617. VariantHairless polypeptides are encoded by suitable variant Hairlesspolynucleotides.

Structure-function relationships of Hairless may be studied usingvariant polypeptides in a transcription assay. Thus, mutations indiscrete domains of the Hairless polypeptide may be associated withhormone receptor binding, activation of transcription, repression oftranscription, or combinations thereof. Binding studies may also be usedto identify and isolate a natural ligand for Hairless. Chemical agentswhich bind Hairless may be useful for modulating (i.e., activating orrepressing) its transcriptional activity.

A human Hairless polynucleotide, polypeptide, or specific bindingmolecule may be used to identify and detect this genetic marker infamily pedigrees (e.g., CEPH/NIH or Utah projects), radiation hybrids,or human-rodent somatic cell hybrids. Fingerprinting would allowidentification of an individual within a genetically similar populationor construction of a genealogy among genetically related individuals.Genetic divergence during the evolution of mammals would predict thatthe human gene and protein would be more similar to other primates, thanto primates. Thus, genetic differences as reflected in the affinity ofspecific binding molecules or sequence comparisons may be used inmolecular taxonomy to determine evolutionary relatedness of differentspecies.

Mutations that are functionally significant and polymorphisms inHairless nucleotide and amino acid sequences are also an aspect of theinvention. Such variants may be mutations or polymorphisms found asnatural variations in the population without phenotypic consequence, ormay be affect biological function by increasing or decreasingtranscriptional activity. Comparison of SEQ ID NOS:2-4 shows severalpossible polymorphisms. Ahmad et al. (Science, 279, 720-724, 1998) andCichon et al. (Hum. Mol. Genet., 7, 1671-1679 and 1987-1988, 1998)describe mutations that may cause congenital alopecia. Mutations may belocated in regulatory and/or coding regions of the gene, they are usefulto establish structure-function relationships in the disclosed aminoacid sequence.

For example, a retroviral insertion which reduces the steady-state levelof mouse Hairless message is responsible for the phenotype. The rhinoallele exhibits no or little message in a hr^(rh)/hr^(rh) homozygote.Such a null (or almost null) mutant, or a cell derived therefrom, mayserve as a host for introduction of an expression construct. A dominantallele Hr^(n) (near naked) has is also being characterized. See,generally Sundberg, Handbook of Mouse Mutations with Skin and HairAbnormalities, CRC Press, 1994.

Genetic polymorphism in the Hairless gene may be used in linkagemapping, genetic fingerprinting, molecular taxonomy, and to study therole of Hairless in quantitative trait linkage (QTL), especiallyalopecia. For example, detection of a restriction fragment lengthpolymorphism (RFLP), random amplified polymorphic DNA (RAPD), amplifiedfragment length polymorphism (AFLP), single-stranded conformationalpolymorphism (SSCP), single nucleotide polymorphism (SNP), short tandemrepeat (STR), variable nucleotide tandem repeat (VNTR), ormicro-satellite length heterogeneity may be linked to a genetic trait orphenotype. Such polymorphisms (or mutations if the polymorphism resultsin a mutant phenotype) may also be useful to investigate gene expressionand development.

Detection of a germline or somatic mutation will determine that adisease is inherited or acquired, respectively. Identification ofmutations by molecular genetic or cytogenetic techniques may alsodetermine how Hairless expression is regulated during development.

The Hairless nucleotide sequence can be used to produce a fusionpolypeptide with at least one heterologous peptide domain (e.g., anaffinity or epitope tag). Polypeptide antigens are useful for producingspecific antibody and epitope mapping of Hairless-specific antibody.Hairless polypeptide may be any length between about 5 amino acidresidues to about 1250 amino acid residues. Preferably, it is producedin soluble form and/or refolded in native conformation. Polypeptide maybe conjugated to either member of a specific binding pair (e.g.,antibody-digoxygenin/hapten/peptide, biotin-avidin/streptavidin,GST-glutathione, MBP-maltose, polyhistidine-nickel, proteinA/G-immunoglobulin).

Polypeptide may be synthesized by chemical means, purified from naturalsources, synthesized in transfected host cells, or combinations thereof.Polypeptide synthesized in transfected bacteria from an expressionconstruct will be non-glycosylated but, if eukaryotic post-translationalmodifications are desired, the expression construct may be transfectedinto a suitable eukaryotic cell (e.g., yeast, insect, hamster, mouse,rat, somatic, stem, non-human zygote) or organism (e.g., insect,non-human mammal, hamster, mouse, rat, plant). Hairless polynucleotideper se or an expression construct comprising the Hairless polynucleotidemay be introduced into the host cell or organism by a process such aschemical transfection (e.g., calcium phosphate, cationic liposome,DEAE-dextran, polybrene), electroporation, genetic immunization,infection by recombinant virus, or microinjection. Preferably theintroduced polynucleotide is an expression construct and the expressionconstruct integrates into the eukaryotic genome of the host cell ororganism. See, for example, Goeddel, Gene Expression Technology,Academic, 1990; Murray, Gene Transfer and Expression Protocols, Humana,1991; Tuan, Recombinant Gene Expression Protocols, Humana, 1997; Tuan,Recombinant Protein Protocols, Humana, 1997.

According to another aspect of invention, a set of oligonucleotides maybe selected from the Hairless nucleotide sequence. This set of primerswill be specific for amplification of Hairless gene and can be used as apair for PCR and RT-PCR amplification of DNA and RNA, respectively; asingle oligonucleotide can be used for specific hybridization to aHairless nucleotide sequence.

The polynucleotide may be ligated to a linker nucleotide sequence orconjugated to one affinity tag of a specific binding pair (e.g.,antibody-peptide epitope/digoxygenin/hapten, biotin-avidin/streptavidin,GST-glutathione, MBP-maltose, polyhistidine-nickel, proteinA/G-immunoglobulin). The polynucleotide may be conjugated to theaffinity tag by ligation of a nucleotide sequence encoding the affinitytag or by direct chemical linkage to a reactive moiety on the affinitytag by crosslinking.

Polynucleotide and/or polypeptide of the invention may be used as anaffinity tag to identify, isolate, and detect interacting proteins thatbind the Hairless gene or protein. Such interacting proteins mayregulate Hairless gene expression (e.g., affinity chromatography ofsequence-specific DNA-binding proteins, electrophoretic mobility shiftassay, footprinting, methylation interference, one-hybrid system) orform protein complexes with regulate the cellular function of Hairless(e.g., crosslinking of protein complexes, screening a phage displaylibrary, two-hybrid system). The invention is not limited to suchprotein agents but may also be used to identify, isolate, and detectother chemical agents which may regulate Hairless gene expression orHairless protein function by screening, for example, a combinatorial ornatural product library for agents which potentiate or inhibit thegrowth and maintenance of hair.

Further, the polynucleotide, polypeptide, and specific binding moleculemay be optionally attached to a solid substrate (e.g., glass orsilanized slide, magnetic bead, microtiter plate, nitrocellulose, nylon,resin bead). Such reagent would allow capture of a molecule in solutionby a specific interaction between the cognate molecules andimmobilization of the solution molecule on the solid substrate. See, forexample, U.S. Pat. Nos. 5,143,854; 5,639,603; 5789,162; and 5,789,172.Monitoring Hairless expression is facilitated by using biochips ormicroarrays. See, for example, U.S. Pat. Nos. 5,445,934; 5,510,270;5,545,531; 5,677,195; and a special supplement (Nat. Genet., 21,1-60,1999).

Nucleotide and amino acid sequences may be synthesized in situ on thesubstrate by solid phase chemistry or photolithography. In situsynthesis attaches the nucleotides or amino acids directly to thesubstrate. Alternatively, the polynucleotide, polypeptide, or specificbinding molecule may be attached by interaction of a specific bindingpair (e.g., antibody-digoxygenin/hapten/peptide,biotin-avidin/streptavidin, GST-glutathione, MBP-maltose,polyhistidine-nickel, protein A/G-immunoglobulin); crosslinking may beused if covalent attachment to the substrate is desired. Glutaraldehydeis a covalent bifunctional crosslinker suitable for immobilization on asubstrate, but a photoactivatable, reversible crosslinker is preferredto identify and isolate molecules interacting in a complex (e.g., athiol linkage that may be reduced).

Hybridization may take place in solution or on a solid substrate. Ifeither the Hairless polynucleotide or probe that undergoes hybridizationis attached to a solid substrate (e.g., glass or silanized slide,magnetic bead, microtiter plate, nitrocellulose, nylon, resin bead),hybridization will result in capture of the unattached species.

An overlapping set of polypeptides which define all possible linearepitopes of Hairless may be arranged on a solid substrate to map theepitope specifically bound by a binding molecule (e.g., polyclonal ormonoclonal antibody). See U.S. Pat. No. 5,194,392. Once a reactiveepitope is defined, it may be used to isolate the specific bindingmolecule or to inhibit binding between Hairless and the specific bindingmolecule. A polypeptide or specific binding molecule thereof may be usedto establish a profiling reference panel, and thereby isolate, detect,or otherwise characterize the chemical agents of the invention. See U.S.Pat. Nos. 5,384,263; 5,541,070; and 5,798,275.

Multiplex analysis may be used to monitor Hairless expression togetherwith expression of another transcription factor (e.g., thyroid hormoneand ROR orphan receptors) or combinations thereof. Such multiplexanalysis may be performed using different polynucleotides orpolypeptides arranged in high density on a solid substrate (i.e., amicroarray). However, simultaneous solution methods such as multi-proberibonuclease protection assay or multi-primer pair polynucleotideamplification associate each transcript with a different length ofdetected product which is resolved by separation according to molecularweight.

Related nucleotide sequences may be defined by a combination ofstructural and functional criteria. For example, related nucleotidesequences derived from the human Hairless sequence may hybridize understringent conditions known in the art. Suitable conditions foroligonucleotides 50 bases or less could be 400 mM NaCl, 40 mM PIPES pH6.4, 1 mM EDTA, 50° C. or 70° C. (see Beltz et al., Meth. Enzymol., 100,266-285, 1983); and suitable conditions for polynucleotides longer than50 bases could be 500 mM NaHPO₄ pH 7.2, 7% mM sodium dodecyl sulfate(SDS), 1% bovine serum albumin (BSA), 1 mM EDTA, 65° C. (Church andGilbert, Proc. Natl. Acad. Sci. USA, 81, 1991-1995, 1984). Short,conserved peptide domains may be used to design amplification primerswhich probe for related nucleotide sequences (Gould et al., Proc. Natl.Acad. Sci. USA, 86, 1934-1938, 1989).

However, these are all rather strict definition because some nucleotidesequences which encode the Hairless polypeptide with 100% identity(i.e., a functional equivalent of the native Hairless polynucleotide)would fail to hybridize under stringent conditions because of theredundancy of the genetic code, but are desirable for use in anexpression construct because of the preferences of host cells andorganisms for certain codons.

An “isolated” polynucleotide or polypeptide is at least partiallyisolated from the source of the polynucleotide or polypeptide. Using thenucleotide and amino acid sequences disclosed herein, compositions orextracts of the invention may be made substantially pure by controlledexpression of the polynucleotide or polypeptide, and isolating same.Expression may be accomplished by extraction from natural sources,recombinant technology, or chemical synthesis.

By “substantially pure”, a composition or extract containing a moleculeis described as being at least 80%, preferably at least 90%, morepreferably at least 95%, and most preferably at least 99% pure by weightas compared to other substances (i.e., contaminants) of the samechemical character as the recited molecule (e.g., nucleotide, aminoacid).

Thus, a “purified” polynucleotide or polypeptide is assessed relative tothe starting source (e.g., cytoplasm, nucleoplasm, cellular or nuclearlysate, cellular or tissue extract) from which purification wasinitiated.

Preferably, such compositions or extracts are reduced by at least 95% ofthe initial number of intact cells and/or viral particles (i.e., 95%free). A substantially cell-free composition or extract is reduced by atleast 99% of the initial number of intact cells and/or viral particles,and a reduction of at least 99.99% may also be achieved. Compositions orextracts may also be cleared so that they are substantially free ofmembranes or membrane-bounded structures (reduced by at least 95% of theinitial membrane content by weight).

Binding is described as “specific” for binding which is able todiscriminate Hairless polynucleotide or polypeptide from a mixture ofother chemical substances which are not related to Hairless. Processesof isolation, detection, and identification may depend on specificbinding of Hairless polynucleotide or polypeptide in the mixture. Theskilled artisan would be able to determine appropriate processconditions to achieve specific binding by choice of length of time,temperature, salt concentration, surfactant, pre-treatment (e.g.,adsorption, affinity purification, subtraction), and post-treatment(e.g., additional rounds of binding, signal amplification, washing).

The meaning of “heterologous” depends on context. For example,heterologous polynucleotide regions or polypeptide domains may mean thatsome regions/domains are not found in the same species in nature (e.g.,a human polynucleotide encoding Hairless and a prokaryotic-derivedpromoter). Another example is that heterologous polynucleotide regionsor polypeptide domains may mean that the regions/domains are not foundjoined together in nature (e.g., a human Hairless polypeptide and a MYCepitope tag, nuclear localization signal, or DNA binding domain).Ligation of polynucleotide regions or fusion of polypeptide domainsoccurs by inventive manipulation, such as by de novo synthesis orrecombination. Of course, such joining may be preceded or followed byfragmentation (e.g., hydrolysis of a phosphodiester or peptide bond)through enzymatic (e.g., nuclease or protease) or chemical methods. In afurther example, transfection of an expression construct into aheterologous host cell or heterologous non-human transgenic animal meansthat the expression construct is not found in the cell's or animal'sgenome in nature.

Similarly, the meaning of “native” depends on context. For a humanpolynucleotide or polypeptide, it may mean that thepolynucleotide/polypeptide was purified from a human source, has asequence identical to a non-mutant human Hairless gene or protein,shares a conformation with properly folded polynucleotide/polypeptide,or is not denatured.

Standard techniques in the art are described in Ausubel et al., CurrentProtocols in Molecular Biology, Wiley, 1998; Birren et al., GenomeAnalysis Series, CSHL, 1997-1999; Coligan et al., Current Protocols inImmunology, Wiley, 1998; Coligan et al., Current Protocols in ProteinScience, Wiley, 1998; Diffenbach and Dveksler, PCR Primer, CSHL, 1995;Dracopoli et al., Current Protocols in Human Genetics, Wiley, 1998;Harlow and Lane, Antibodies and Using Antibodies, CSHL, 1988 and 1999;Hogan et al., Manipulating the Mouse Embryo, CSHL, 1994; Janson andRyder, Protein Purification, Wiley, 1997; Marshak et al., Strategies forProtein Purification and Characterization, CSHL, 1996; Mullis et al.,The Polymerase Chain Reaction, Birkhauser, 1994; Murphy and Carter,Trangenesis Techniques, Humana, 1993; Pinkert, Trangenic AnimalTechnology, Academic, 1994; Sambrook et al., Molecular Cloning, CSHL,1989; Spector et al., Cells, CSHL, 1998.

All publications, applications, and patents cited in this specificationare indicative of the skill in the art, and are incorporated herein byreference in their entirety.

The following examples are meant to be illustrative of the presentinvention, however the practice of the invention is not limited orrestricted in any way by them.

EXAMPLES

Thyroid hormone is a critical mediator of central nervous system (CNS)development, acting through nuclear receptors to modulate the expressionof specific genes. Transcription of the rat hairless (hr) gene is highlyup-regulated by thyroid hormone in the developing CNS; it is shown herethat hr is directly induced by thyroid hormone. By identifying proteinsthat interact with the hr gene product (hr), it was found that hrinteracts directly and specifically with thyroid hormone receptor(TR)-the same protein that regulates its expression. hr associates withTR and not with retinoid or steroid receptors; this specificitydistinguishes hr as the only interacting protein known to date thatbinds to a single nuclear receptor. hr can act as a transcriptionalrepressor, suggesting that its interaction with TR is part of a novelautoregulatory mechanism. Many factors, both genetic and environmental,contribute to the formation and function of the mammalian centralnervous system (CNS). An essential component of these processes isthyroid hormone; if thyroid hormone levels are perturbed, abnormaldevelopment ensues resulting in neurological deficits that includesevere mental retardation. The effects of thyroid hormone (TH) aremediated through the action of specific nuclear receptor proteins.Thyroid hormone receptors (TR) act by binding to specific DNA sequencesand subsequently activating or repressing the transcription of nearbygenes in response to hormone binding. Several proteins that interactwith TR and other nuclear hormone receptors, including bothco-activators and co-repressors, have been identified (Horwitz et al.,Mol. Endocrinol., 10, 1167-1177, 1996; Beato and Sánchez-Pacheco,Endocrine Reviews, 17, 587-609, 1996).

Although much is known about the mechanism of action of thyroid hormoneand other nuclear receptors, far less is known about the genes regulatedby these receptors. The rat hairless (hr) gene has been shown to beup-regulated (>10-fold) by thyroid hormone in developing brain(Thompson, J. Neurosci., 16, 7832-7840, 1996). The rapid induction (<4hours) occurs even in the absence of protein synthesis, suggesting thathr is directly regulated by TR. Direct target genes are particularlyimportant because such genes likely constitute the first step in thegenetic program responsible for TH-mediated aspects of neuraldevelopment. It is shown here that the upstream regulatory region of thehr gene includes a potent thyroid hormone response element (TRE),indicating that hr is indeed a direct target of TR.

The murine hr locus was originally identified as a spontaneous mutationcaused by an endogenous retrovirus (Stoye et al., Cell, 54, 383-391,1988). The hr gene is expressed predominantly in skin and brain; themutant phenotype in skin is progressive hair loss and increasedsusceptibility to cancer, the neurological phenotype has not yet beendescribed (Thompson, J. Neurosci., 16, 7832-7840, 1996; Cachon-Gonzalezet al., Proc. Natl. Acad. Sci. USA, 91, 7717-7721, 1994). The hr geneencodes a putative protein of approximately 130 KD that lacks homologyto known structural motifs other than a cluster of cysteine residuesproposed to form a zinc finger (Cachon-Gonzalez et al., Proc. Natl.Acad. Sci. USA, 91, 7717-7721, 1994).

Towards defining the function of the hr gene product (hr), proteins thatinteract with hr were identified. Surprisingly, it was found that hrinteracts with TR. Previously identified proteins that interact with TRhave been shown to interact with multiple nuclear receptors (Baniahmadet al., Proc. Natl. Acad. Sci. USA 90, 8832-8836, 1993; Oñate et al.,Science, 270, 1354-1357, 1995; Lee et al., Nature, 374, 91-94, 1995;Chen and Evans, Nature, 377, 454-457, 1995; Hörlein et al., Nature, 377,397-404, 1995; Zeiner and Gehring, Proc. Natl. Acad. Sci. USA, 92,11465-11469, 1995; Horwitz et al., Mol. Endocrinol., 10, 1167-1177,1996;Beato and Sánchez-Pacheco, Endocrine Reviews, 17, 587-609, 1996;L'Horset et al., Mol. Cell Biol., 16, 6029-6036, 1996; vom Baur et al.,EMBO J., 15, 110-124, 1996). In contrast, hr interacts only with TR. Theinteraction of hr with TR suggests that hr is part of a novelautoregulatory mechanism by which hr may influence the expression ofdownstream TH-responsive genes.

hr Gene Expression is Directly Regulated by Thyroid Hormone

A rat genomic library (Stratagene) was screened using a 450 bp probefrom the 5′ end of the hr cDNA. Four overlapping clones were isolated. A15 Kb Not I fragment was digested with Ksp I and the resulting 6 and 9Kb fragments subcloned into pBluescript (Stratagene). The subclonedfragments were digested with Alu I and used for gel shift analysis.Proteins were synthesized by coupled in vitro transcription/translation(Promega) in the presence of ³⁵S-methionine (NEN). Synthesis wasanalyzed by running a fraction of the radiolabeled products on a gelfollowed by autoradiography. DNA binding reactions containedapproximately 200 ng of digested DNA mixed with ³⁵S-TR and RXR. Sampleswere run on 5% polyacrylamide gels in 0.5×TBE, fixed, dried and exposedto X-ray film. Fragments that gave shifted bands were restriction mappedand smaller fragments subcloned. The subcloned fragments were digestedwith Alu I and used for DNA binding. This process was repeated until thesmallest binding fragment was determined to be a 106 bp Hinf I-Eag Ifragment. After sequencing the 106 bp fragment (SEQ ID NO:7 includes thenamed restriction sites), overlapping oligonucleotides spanning thefragment were synthesized and used as competitors for DNA binding. Thefunctionally equivalent mouse nucleotide sequence is SEQ ID NO:8. Fortransfection experiments, oligonucleotides were cloned upstream of aminimal thymidine kinase promoter by digesting tk-luc with Hind III,then ligating the annealed, phosphorylated oligonucleotides. Constructswere sequenced to determine number of oligonucleotides present and toconfirm the sequence and orientation.

To determine if hr is a direct target of thyroid hormone receptor (TR),cis-acting sequences were examined to determine whether controlling itsexpression include a binding site for TR and/or TR/retinoid X receptor(RXR) heterodimers. Genomic sequences from the hr gene were digestedwith frequent cutting restriction enzymes and used as probes in a gelretardation assay using ³⁵S-TR. No binding was observed when fragmentswere incubated with ³⁵S-TR alone, but binding was detected when both TRand RXR (unlabeled) were present. A high affinity TR/RXR binding sitewas detected within a 9 Kb Not I-Ksp I fragment immediately upstream ofthe hr transcription unit. By subcloning and testing progressivelysmaller restriction fragments, the TR/RXR binding site was mapped towithin 106 bp located approximately 9 Kb upstream of the first exon. Theisolated, ³²P-labeled 106 bp sequence bound specifically to TR/RXRheterodimers, as binding was competed by TRE-containing oligonucleotides(DR4, synthetic direct repeat TRE; MLV, Moloney Leukemia virus TRE) butnot by a mutated TRE (MHC-M, mutated TRE from α-myosin heavy chain gene)or a retinoic acid response element (RARE, retinoic acid responseelement DR5) (Umesono et al., Cell, 65, 1255-1266, 1991).

To more precisely define the TR/RXR binding site, overlappingoligonucleotides encompassing the 106 bp sequence were synthesized andused as competitors for binding to the 106 bp fragment. Onlyoligonucleotides C and E were effective competitors, and they bothcontained the TR/RXR binding site. These oligonucleotides share a 23 bpsequence that includes an imperfect direct repeat(ggtggAGGGCATCTGAGGACAtc, SEQ ID NO:9) separated by four nucleotides.TREs often consist of half sites spaced by four nucleotides (DR+4), withan optimal half site of AGGTCA. Both half sites of the hr TRE match theoptimal half site in five of six positions. Thus, the hr gene has apotential TRE of the consensus type DR+4. The minimal sequence assayedthat conferred thyroid hormone responsiveness and binds TR/RXR is SEQ IDNO:10. The mouse sequence is identical.

GH1 (rat pituitary) cells were obtained from ATCC and maintained in DMEMsupplemented with 10% fetal calf serum. For induction experiments, serumwas depleted of thyroid and steroid hormones by treatment with AG-1-X8resin (Bio-Rad) and charcoal (Sigma) as described (Samuels et al.,Endocrinology, 105, 80-85, 1979). Cells were grown for one day inhormone depleted media before transfection. Transfection was bylipofection (LIPOFECTAMINE, Gibco-BRL) in 6-well plates. Aftertransfection, thyroid hormone (L-T₃) was added to 10⁻⁷ M. Cells weretransfected (per well of a 6-well plate) with 167 ng of reporterplasmid, 50 ng of expression plasmid and 80 ng of CMX-βgal. Cells wereharvested using 1×reporter lysis buffer (Promega) and assayed forβ-galactosidase and luciferase activity.

To test whether the direct repeat sequence motif indeed confers thyroidhormone responsiveness, the 106 bp fragment and putative TREoligonucleotides were individually placed upstream of a minimalthymidine kinase (tk) promoter driving expression of a luciferasereporter gene. Introduction of these constructs into GH1 cells, whichexpress endogenous thyroid hormone receptors, showed that transcriptionis activated in the presence of thyroid hormone, only when the directrepeat sequence is present (106HE-tkluc, C-tkluc). Therefore, the directrepeat sequence in the hr gene acts as a TRE. Together with previousdata showing that up-regulation of hr by thyroid hormone occurs rapidly(<4 hours) and without the need for new protein synthesis (Thompson, J.Neurosci., 16, 7832-7840, 1996), these results demonstrate that hr is adirect response gene for thyroid hormone, the first such gene identifiedin the mammalian CNS.

hr Interacts with TR

To construct pLexA-hr, a 2.2 Kb Hind III fragment corresponding to aminoacids 575-1215 of hr (FIG. 1) was isolated, the ends filled-in withKlenow large fragment, Bam HI linkers ligated, and then cloned into theBam HI site of pLexA (Hollenberg et al., Mol. Cell Biol., 15, 3813-3822,1995). The resulting plasmid was transformed into yeast strain L40(Hollenberg et al., ibid.). The resulting strain was used to screen ahuman brain cDNA library constructed as a fusion with the activationdomain of VP16. DNA was isolated from HIS+, lacZ+ colonies (Robzyk andKassir, Nucleic Acids Res., 20, 3790, 1992), propagated in E. coli,purified and sequenced. Cells were tested for β-galactosidase activityas described (Reynolds and Lundblad, in Short Protocols in MolecularBiology, Ausubel et al., eds., John Wiley, New York, p. 13-27, 1992). Totest the hormone dependence of interaction, TRIAC (Sigma) was added tothe media and assay buffer (final concentration 10⁻⁶ M).

To begin to understand the function of the hr gene product (hr),proteins were identified that interact with hr by using a two-hybridassay (Hollenberg et al., ibid.). The C terminal 639 amino acids of hr(amino acids 575-1215 as shown in FIG. 1, includes the putative zincfinger) were fused to the lexA DNA binding domain and used as “bait”(LexA-hr) to screen a human brain cDNA library. One cDNA that wasisolated multiple times (2H11) was characterized. Remarkably, clone 2H11encodes a thyroid hormone receptor (TRα2, amino acids 14-490) (Lazar,Endocrine Rev., 14, 184-193, 1993). However, interaction is not limitedto the TRα2 isoform, as TRα1 was found to interact as well. TRα1 is afunctional TR, while TRα2, which has a divergent C terminus, lacks theability to bind thyroid hormone; the preference for isolating TRα2 islikely because the mRNA for TRα2 is more abundant than that for TRα1.Interaction was moderately influenced by hormone, as interaction of TRα1was reduced two-fold by hormone binding. Thus, it appears that theproduct of the hr gene, a direct target of transcriptional regulation byTR, interacts with the same factor that regulates its expression.

Far Western Assay

TrpE-hr was constructed by insertion of a 2.2 Kb Hind III fragmentcorresponding to amino acids 575-1215 of hr into pATH21 (kindly providedby N. Patel). GST-hr was constructed by insertion of the 2.2 Kb Bam HIfragment from pLexA-hr into pGEX3X (kindly provided by J. Shuman).GST-RXR and GST-TR were obtained from Santa Cruz Biotechnology. Extractsfrom bacteria expressing fusion proteins were separated bySDS-polyacrylamide gel electrophoresis and transferred tonitrocellulose. After transfer, filters were prepared for far westernblotting as described (Cavaillès et al., Proc. Natl. Acad. Sci. USA, 91,10009-10013, 1994) except that ³⁵S-labeled proteins were used as probes.pTZ18 (rTRβ1) was kindly provided by H. Towle; pCMX TRα1, pCMX hRARα andpCMX hRXRα were kindly provided by K. Umesono.

To confirm the direct interaction between hr and TR, a far western assaywas used. hr (amino acids 575-1215) was expressed in bacteria as afusion protein with either glutathione S transferase (GST) or TrpE.Extracts from bacteria expressing hr fusion proteins were separated bySDS-polyacrylamide gel electrophoresis and transferred tonitrocellulose: TrpE-Srg1 (negative control), TrpE-hr, GST only, andGST-hr. The immobilized, renatured proteins were incubated with³⁵S-TRα1. TRα1 detected a protein the size of the hr fusion proteins,which was recognized by hr-specific antisera. Therefore, hr interactsspecifically with TRα1. These data also show that no other factors (forexample, other proteins in yeast) are required for this interaction.

Previously identified factors that interact with TR have been shown toalso associate with retinoid receptors, and in some cases, other nuclearreceptors as well. To examine the specificity of interaction between TRand hr, hr was tested for interaction with TRβ1, retinoic acid receptor(RAR) and retinoid X receptor (RXR). TRβ1 bound as well as TRα1 to thehr fusion protein, indicating that interaction is not isoform-specific.In contrast, binding was not detected with RAR or RXR. ³⁵S-RARinteracted with RXR, and ³⁵S-RXR interacted with TR, verifying that RARand RXR are functional in this assay. Consistent with these results, RARdid not interact with hr in the two-hybrid assay. Two steroid hormonereceptors (glucocorticoid and mineralocorticoid) were tested forinteraction using the far western assay, and also failed to interactwith hr. Therefore, of the receptors tested, binding is specific for TR.These results are particularly important because they distinguish hr asthe only protein known to date that binds specifically to a singlenuclear receptor.

Immunohistochemistry and In Vivo Transcription Activity

GH1 and CHO cells were obtained from ATCC and maintained in DMEMsupplemented with 10% fetal calf serum. Transfection was by lipofection(LIPOFECTAMINE, Gibco-BRL) in 6-well plates. Cells were transfected (perwell of a 6-well plate) with 167 ng of reporter plasmid, 50 ng ofexpression plasmid and 80 ng of CMX-βgal. pCMX-GAL-hr was constructed byinserting the Bam HI fragment from pLexA-hr into pCMX-GAL4. pCMX-GAL4,GALpx3 tkluc, pCMX GAL-RXR were kindly provided by K. Umesono. Cellswere harvested using 1×reporter lysis buffer (Promega) and assayed forβ-galactosidase and luciferase activity.

In addition to their interaction in vitro, hr and TR are expressed inthe same cell types in vivo. It was shown previously that hr isexpressed in tissues that express TR; in situ hybridization analyseshave shown that hr and TR transcripts are present in the same cellpopulations in the brain (Murray et al., J. Biol. Chem., 263,12770-12777, 1988; Bradley et al., J. Neurosci., 12, 2288-2302, 1992;Thompson, J. Neurosci., 16, 7832-7840, 1996). For interaction of hr andTR to occur in vivo, both must occupy the same subcellular compartment.TR resides in the nucleus; to determine if hr is also nuclear, sequencesencoding an epitope (MYC) detected by a specific monoclonal antibodywere appended to the hr cDNA (MYC-hr).

An epitope for MYC was appended to the amino terminus of hr at aminoacid 200 by subcloning a 3.1 Kb Bam HI-Xba I fragment of hr into thevector pBS KS+MYC (kindly provided by M. Bellini). The resulting MYC-hrfusion was excised by digestion with Xba I and partial digestion withHind III to isolate a 3.2 Kb fragment, which was inserted downstream ofthe RSV LTR. The resulting cDNA was transfected into GH1 cells, and theMYC epitope detected by immunofluorescence. GH1 cells were grown oncoverslips and transfected by lipofection. Cells were fixed with 1.6%paraformaldehyde for 20 minutes at room temperature, blocked for 30minutes in PBS with 5% normal goat serum and incubated with a mousemonoclonal antibody to MYC (9E10, kindly provided by Z. Wu) for 1 hour.Detection was with cyanine dye CY3 anti-mouse antibody (JacksonImmunoResearch). Cells were mounted in 50% glycerol with 0.25 μg/ml DAPIto counterstain nuclei. Nuclear staining was observed in cellstransfected with MYC-hr but not in control cells. Identical results wereobtained using hr-specific antisera. Thus, like TR, hr is a nuclearprotein.

Identical results were obtained using the hr-specific antisera raised tothe GST-hr fusion protein described above. Antiserum specific to thecarboxyl terminal region of Hr were generated by injecting individualrabbits with acrylamide containing either the trpE-Hr fusion protein (Hramino acids 575-1215) or a GST-Hr fusion protein (Hr amino acids730-1215) using a standard injection schedule. Serum from rabbitsinjected with either fusion protein recognizes a protein band with thepredicted size of Hr (i.e., about 127 KDa) on western blots usingextracts from cells transfected with an expression construct containingthe hr gene and not in extracts from control non-transfected cells.Cells transfected in parallel with RSV-βgal were used as a negativecontrol for the anti-MYC antibody and stained with X-Gal as a positivecontrol for transfection. The pattern of bands detected by the antiserumis more complicated in total protein extracts from tissue sources. Theantiserum detects a band close to the predicted size of Hr but alsoseveral smaller bands. The multiple species detected may representdegradation products of Hr, alternative spliced/processed forms of Hr,related proteins that have a cross-reacting epitope, or combinationsthereof. Further characterization of the specificity and sensitivity ofthis antiserum is in progress.

Since hr is a nuclear protein that interacts with a known transcriptionfactor, hr was tested for a role in transcriptional regulation. Whenco-expressed with TR and a TRE-containing reporter, thyroid hormoneinduced transcription is reduced two fold, suggesting that hr acts as arepressor. To assay for endogenous repressor function, hr (amino acids575-1215) was fused to the GAL4 DNA binding domain (DBD). hr/GAL4plasmids were co-transfected with a GAL4 regulated promoter into CHOcells; all transfections included CMX-βgal as an internal control. Whentested for the ability to repress transcription from a GAL4-dependentreporter, the GAL4 DBD-hr fusion protein reduced basal leveltranscription by about five fold. This effect is specific to hrsequences because the GAL4 DBD alone or a GAL4 DBD-RXR fusion proteindid not affect activity. hr alone did not affect activity, indicatingthat repression by hr required tethering to DNA. The ability of hr torepress transcription indicates that when bound to thyroid hormonereceptors, hr may function as a transcriptional modulator.

Discussion

Together with previous evidence that the hr gene is rapidly up-regulatedby thyroid hormone even in the absence of protein synthesis, the mappingof a high affinity TRE in the hr gene demonstrates that hr is a directtarget of thyroid hormone receptors in the developing mammalian CNS.Although a handful of genes whose expression is influenced by thyroidhormone in the CNS have been identified, induction of these genes hasnot been shown to be rapid (>24 hours) nor resistant to inhibitors ofprotein synthesis. Thus, hr is the first direct response gene forthyroid hormone identified in the developing mammalian nervous system.Given that postnatal CNS development is extremely sensitive to thyroidhormone (if thyroid hormone levels are perturbed, abnormal developmentensues, resulting in neurological deficits that include severe mentalretardation), expression of hr likely constitutes a key step in thegenetic program responsible for TH-dependent aspects of CNS development.

The screen for proteins that interact with hr led to the startlingdiscovery that the product of this thyroid hormone-responsive geneinteracts directly and specifically with TR—the same protein thatinduces its expression. Equally important is the finding that althoughhr binds to TR, it does not bind to RAR or to their common partner, RXR.Though many proteins that interact with nuclear hormone receptors havebeen identified, all have been shown to be widely expressed and to bindto multiple receptors. In contrast, hr is predominantly expressed inbrain and skin, and hr binds a single nuclear receptor, TR. hr and TRare both nuclear proteins, and are co-expressed in various regions ofthe brain, suggesting that the interaction observed in vitro also occursin vivo.

The induction of hr expression by thyroid hormone, coupled with theinteraction of hr protein with TR, suggests a novel autoregulatorypathway. Once induced by thyroid hormone, hr likely binds to TR, andthrough its repression function, modulates expression of downstreamgenes. The existence of a similar autoregulatory mechanism for othernuclear receptors is hinted at by the product of an estrogen-responsivegene (efp) that shows homology to the TIF/PML class of receptorinteracting proteins (Inoue et al., Proc. Natl. Acad. Sci. USA, 90,11117-11121, 1993; LeDouarin et al., EMBO J., 14, 2020-2033, 1995). As adirect target gene, together with its ability to interact with TR, hrlikely serves a dual role—as a downstream target as well as upstreamregulator of thyroid hormone action.

Human Hairless Gene and Protein Sequences

The human homolog of the rat hairless gene was cloned by screening ahuman motor cortex cDNA library (obtained from J. Arriza) using a rat hrcDNA as a probe (Hind III fragment, nucleotides 2088-4299; see accessionnumber U71293 from Thompson, J. Neurosci., 16, 7832-7840, 1996). Thelibrary was screened with hybridization conditions of 50% formamide,5×SSPE, 1×Denhardt's solution, 0.1% SDS, and 100 mg/ml denatured salmonsperm DNA at 42° C. for 16 hours. Four positive clones were detected(37MC1, 37MC3, 37MC6, 37MC12) and the cDNA inserts were obtained inplasmid form. The inserts of 37MC1, 6 and 12 were about 3.5 Kb and theinsert of 37MC3 was about 2 Kb.

The nucleotide sequences from the 5′ and 3′ ends were determined forthese cDNA inserts and found to be homologous to rat and mouse hr cDNAs.The full sequence of 37MC12 was determined by sequencing of bothstrands. The sequence of the longest cDNA clone corresponds to aminoacid positions 232-1215 of the full-length human Hr as shown in FIG. 1.

In Table I, the human HR nucleotide sequence (SEQ ID NO:1) is alignedabove its predicted amino acid sequence (SEQ ID NO:2).

TABLE I TTT TAC TAC AAG GAT CCG AGC ATT CCC AGG TTG GCA AAG GAG CCC 45Phe Tyr Tyr Lys Asp Pro Ser Ile Pro Arg Leu Ala Lys Glu Pro  1               5                  10                  15 TTG GCA GCTGCG GAA CCT GGG TTG TTT GGC TTA AAC TCT GGT GGG 90 Leu Ala Ala Ala GluPro Gly Leu Phe Gly Leu Asn Ser Gly Gly                 20                  25                  30 CAC CTG CAGAGA GCC GGG GAG GCC GAA CGC CCT TCA CTG CAC CAG 135 His Leu Gln Arg AlaGly Glu Ala Glu Arg Pro Ser Leu His Gln                 35                  40                  45 AGG GAT GGAGAG ATG GGA GCT GGC CGG CAG CAG AAT CCT TGC CCG 180 Arg Asp Gly Glu MetGly Ala Gly Arg Gln Gln Asn Pro Cys Pro                 50                  55                  60 CTC TTC CTGGGG CAG CCA GAC ACT GTG CCC TGG ACC TCC TGG CCC 225 Leu Phe Leu Gly GlnPro Asp Thr Val Pro Trp Thr Ser Trp Pro                 65                  70                  75 GCT TGT CCCCCA GGC CTT GTT CAT ACT CTT GGC AAC GTC TGG GCT 270 Ala Cys Pro Pro GiyLeu Val His Thr Leu Gly Asn Val Trp Ala                 80                  85                  90 GGG CCA GGCGAT GGG AAC CTT GGG TAC CAG CTG GGG CCA CCA GCA 315 Gly Pro Gly Asp GlyAsn Leu Gly Tyr Gln Leu Gly Pro Pro Ala                 95                 100                 105 ACA CCA AGGTGC CCC TCT CCT GAG CCG CCT GTC ACC CAG CGG GGC 360 Thr Pro Arg Cys ProSer Pro Glu Pro Pro Val Thr Gln Arg Gly                110                 115                 120 TGC TGT TCATCC TAC CCA CCC ACT AAA GGT GGG GAT CTT GGC CCT 405 Cys Cys Ser Ser TyrPro Pro Thr Lys Gly Gly Asp Leu Gly Pro                125                 130                 135 TGT GGG AAGTGC CAG GAG GGC CTG GAG GGG GGT GCC AGT GGA GCC 450 Cys Gly Lys Cys GlnGlu Gly Leu Glu Gly Gly Ala Ser Gly Ala                140                 145                 150 AGC GAA CCCAGC GAG GAA GTG AAC AAG GCC TCT GGC CCC AGG GCC 495 Ser G1U Pro Ser GluGlu Val Asn Lys Ala Ser Gly Pro Arg Ala                155                 160                 165 TGT CCC CCCAGC CAC CAC ACC AAG CTG AAG AAG ACA TGG CTC ACA 540 Cys Pro Pro Ser HisHis Thr Lys Leu Lys Lys Thr Trp Leu Thr                170                 175                 180 CGG CAC TCGGAG CAG TTT GAA TGT CCA CGC GGC TGC CCT GAG GTC 585 Arg His Ser Glu GlnPhe Glu Cys Pro Arg Gly Cys Pro Glu Val                185                 190                 195 GAG GAG AGGCCG GTT GCT CGG CTC CGG GCC CTC AAA AGG GCA GGC 630 Glu Glu Arg Pro ValAla Arg Leu Arg Ala Leu Lys Arg Ala Gly                200                 205                 210 AGC CCC GAGGTC CAG GGA GCA ATG GGC AGT CCA GCC CCC AAG CGG 675 Ser Pro Glu Val GlnGly Ala Met Gly Ser Pro Ala Pro Lys Arg                215                 220                 225 CCA CCG GACCCT TTC CCA GGC ACT GCA GAA CAG GGG GCT GGG GGT 720 Pro Pro Asp Pro PhePro Gly Thr Ala Glu Gln Gly Ala Gly Gly                230                 235                 240 TGG CAG GAGGTT CGG GAC ACA TCG ATA GGG AAC AAG GAT GTG GAC 765 Trp Gln Glu Val ArgAsp Thr Ser Ile Gly Asn Lys Asp Val Asp                245                 250                 255 TCG GGA CAGCAT GAT GAG CAG AAA GGA CCC CAA GAT GGC CAG GCC 810 Ser Gly Gln His AspGlu Gln Lys Gly Pro Gln Asp Gly Gln Ala                260                 265                 270 AGT CTC CAGGAC CCG GGA CTT CAG GAC ATA CCA TGC CTG GCT CTC 855 Ser Leu Gln Asp ProGly Leu Gln Asp Ile Pro Cys Leu Ala Leu                275                 280                 285 CCT GCA AAACTG GCT CAA TGC CAA AGT TGT GCC CAG GCA GCT GGA 900 Pro Ala Lys Leu AlaGln Cys Gln Ser Cys Ala Gln Ala Ala Gly                290                 295                 300 GAG GGA GGAGGG CAC GCC TGC CAC TCT CAG CAA GTG CGG AGA TCG 945 Glu Gly Gly Gly HisAla Cys His Ser Gln Gln Val Arg Arg Ser                305                 310                 315 CCT CTG GGAGGG GAG CTG CAG CAG GAG GAA GAC ACA GCC ACC AAC 990 Pro Leu Gly Gly GluLeu Gln Gln Glu Glu Asp Thr Ala Thr Asn                320                 325                 330 TCC AGC TCTGAG GAA GGC CCA GGG TCC GGC CCT GAC AGC CGG CTC 1035 Ser Ser Ser Glu GluGly Pro Gly Ser Gly Pro Asp Ser Arg Leu                335                 340                 345 AGC ACA GGCCTC GCC AAG CAC CTG CTC AGT GGT TTG GGG GAC CGA 1080 Ser Thr Gly Leu AlaLys His Leu Leu Ser Gly Leu Gly Asp Arg                350                 355                 360 CTG TGC CGCCTG CTG CGG AGG GAG CGG GAG GCC CTG GCT TGG GCC 1125 Leu Cys Arg Leu LeuArg Arg Glu Arg Glu Ala Leu Ala Trp Ala                365                 370                 375 CAG CGG GAAGGC CAA GGG CCA GCC GTG ACA GGG GAC AGC CCA GGC 1170 Gln Arg Glu Gly GlnGly Pro Ala Val Thr Gly Asp Ser Pro Gly                380                 385                 390 ATT CCA CGCTGC TGC AGC CGT TGC CAC CAT GGA CTC TTC AAC ACC 1215 Ile Pro Arg Cys CysSer Arg Cys His His Gly Leu Phe Asn Thr                395                 400                 405 CAC TGG CGATGT CCC CGC TGC AGC CAC CGG CTG TGT GTG GCC TGT 1260 His Trp Arg Cys ProArg Cys Ser His Arg Leu Cys Val Ala Cys                410                 415                 420 GGT CGT GTGGCA GGC ACT GGG CGG GCC AGG GAG AAA GCA GGC TTT 1305 Gly Arg Val Ala GlyThr Gly Arg Ala Arg Glu Lys Ala Gly Phe                425                 430                 435 CAG GAG CAGTCC GCG GAG GAG TGC ACG CAG GAG GCC GGG CAC GCT 1350 Gln Glu Gln Ser AlaGlu Glu Cys Thr Gln Glu Ala Gly His Ala                440                 445                 450 GCC TGT TCCCTG ATG CTG ACC CAG TTT GTC TCC AGC CAG GCT TTG 1395 Ala Cys Ser Leu MetLeu Thr Gln Phe Val Ser Ser Gln Ala Leu                455                 460                 465 GCA GAG CTGAGC ACT GCA ATG CAC CAG GTC TGG GTC AAG TTT GAT 1440 Ala Glu Leu Ser ThrAla Met His Gln Val Trp Val Lys Phe Asp                470                 475                 480 ATC CGG GGGCAC TGC CCC TGC CAA GCT GAT GCC CGG GTA TGG GCC 1485 Ile Arg Gly His CysPro Cys Gln Ala Asp Ala Arg Val Trp Ala                485                 490                 495 CCC GGG GATGCA GGC CAG CAG AAG GAA TCA ACA CAG AAA ACG CCC 1530 Pro Gly Asp Ala GlyGln Gln Lys Glu Ser Thr Gln Lys Thr Pro                500                 505                 510 CCA ACT CCACAA CCT TCC TGC AAT GGC GAC ACC CAC AGG ACC AAG 1575 Pro Thr Pro Gln ProSer Cys Asn Gly Asp Thr His Arg Thr Lys                515                 520                 525 AGC ATC AAAGAG GAG ACC CCC GAT TCC GCT GAG ACC CCA GCA GAG 1620 Ser Ile Lys Glu GluThr Pro Asp Ser Ala Glu Thr Pro Ala Glu                530                 535                 540 GAC CGT GCTGGC CGA GGG CCC CTG CCT TGT CCT TCT CTC TGC GAA 1665 Asp Arg Ala Gly ArgGly Pro Leu Pro Cys Pro Ser Leu Cys Glu                545                 550                 555 CTG CTG GCTTCT ACC GCG GTC AAA CTC TGC TTG GGC CAT GAG CGA 1710 Leu Leu Ala Ser ThrAla Val Lys Leu Cys Leu Gly His Glu Arg                560                 565                 570 ATA CAC ATGGCC TTC GCC CCC GTC ACT CCG GCC CTG CCC AGT GAT 1755 Ile His Met Ala PheAla Pro Val Thr Pro Ala Leu Pro Ser Asp                575                 580                 585 GAC CGC ATCACC AAC ATC CTG GAC AGC ATT ATC GCA CAG GTG GTG 1800 Asp Arg Ile Thr AsnIle Leu Asp Ser Ile Ile Ala Gln Val Val                590                 595                 600 GAA CGG AAGATC CAG GAG AAA GCC CTG GGG CCG GGG CTT CG AGCT 1845 Glu Arg Lys Ile GlnGlu Lys Ala Leu Gly Pro Gly Leu Arg Ala                605                 610                 615 GGC CCG GGTCTG CGC AAG GGC CTG GGC CTG CCC CTC TCT CCA GTG 1890 Gly Pro Gly Leu ArgLys Gly Leu Gly Leu Pro Leu Ser Pro Val                620                 625                 630 CGG CCC CGGCTG CCT CCC CCA GGG GCT TTG CTG TGG CTG CAG GAG 1935 Arg Pro Arg Leu ProPro Pro Gly Ala Leu Leu Trp Leu Gln Glu                635                 640                 645 CCC CAG CCTTGC CCT CGG CGT GGC TTC CAC CTC TTC CAG GAG CAC 1980 Pro Gln Pro Cys ProArg Arg Gly Phe His Leu Phe Gln Glu His                650                 655                 660 TGG AGG CAGGGC CAG CCT GTG TTG GTG TCA GGG ATC CAA AGG ACA 2025 Trp Arg Gln Gly GlnPro VaL Leu Val Ser Gly Ile Gln Arg Thr                665                 670                 675 TTG CAG GGCAAC CTG TGG GGG ACA GAA GCT CTT GGG GCA CTT GGA 2070 Leu Gln Gly Asn LeuTrp Gly Thr Glu Ala Leu Gly Ala Leu Gly                680                 685                 690 GGC CAG GTGCAG GCG CTG AGC CCC CTC GGA CCT CCC CAG CCC AGC 2115 Gly Gln Val Gln AlaLeu Ser Pro Leu Gly Pro Pro Gln Pro Ser                695                 700                 705 AGC CTG GGCAGC ACA ACA TTC TGG GAG GGC TTC TCC TGG CCT GAG 2160 Ser Leu Gly Ser ThrThr Phe Trp Glu Gly Phe Ser Trp Pro Glu                710                 715                 720 CTT CGC CCAAAG TCA GAC GAG GGC TCT GTC CTC CTG CTG CAC CGA 2205 Leu Arg Pro Lys SerAsp Glu Gly Ser Val Leu Leu Leu His Arg                725                 730                 735 GCT TTG GGGGAT GAG GAC ACC AGC AGG GTG GAG AAC CTA GCT GCC 2250 Ala Leu Gly Asp GluAsp Thr Ser Arg Val Glu Asn Leu Ala Ala                740                 745                 750 AGT CTG CCACTT CCG GAG TAC TGC GCC CTC CAT GGA AAA CTC AAC 2295 Ser Leu Pro Leu ProGlu Tyr Cys Ala Leu His Gly Lys Leu Asn                755                 760                 765 CTG GCT TCCTAC CTC CCA CCG GGC CTT GCC CTG CGT CCA CTG GAG 2340 Leu Ala Ser Tyr LeuPro Pro Gly Leu Ala Leu Arg Pro Leu Glu                770                 775                 780 CCC CAG CTCTGG GCA GCC TAT GGT GTG AGC CCG CAC CGG GGA CAC 2385 Pro Gln Leu Trp AlaAla Tyr Gly Val Ser Pro His Arg Gly His                785                 790                 795 CTG GGG ACCAAG AAC CTC TGT GTG GAG GTG GCC GAC CTG GTC AGC 2430 Leu Gly Thr Lys AsnLeu Cys Val Glu Val Ala Asp Leu Val Ser                800                 805                 810 ATC CTG GTGCAT GCC GAC ACA CCA CTG CCT GCC TGG CAC CGG GCA 2475 Ile Leu Val His AlaAsp Thr Pro Leu Pro Ala Trp His Arg Ala                815                 820                 825 CAG AAA GACTTC CTT TCA GGC CTG GAC GGG GAG GGG CTC TGG TCT 2520 Gln Lys Asp Phe LeuSer Gly Leu Asp Gly Glu Gly Leu Trp Ser                830                 835                 840 CCG GGC AGCCAG GTC AGC ACT GTG TGG CAC GTG TTC CGG GCA CAG 2565 Pro Gly Ser Gln ValSer Thr Val Trp His Val Phe Arg Ala Gln                845                 850                 855 GAC GCC CAGCGC ATC CGC CGC TTT CTC CAG ATG GTG TGC CCG GCC 2610 Asp Ala Gln Arg IleArg Arg Phe Leu Gln Met Val Cys Pro Ala                860                 865                 870 GGG GCA GGCGCC CTG GAG CCT GGC GCC CCA GGC AGC TGC TAC CTG 2655 Gly Ala Gly Ala LeuGlu Pro Gly Ala Pro Gly Ser Cys Tyr Leu                875                 880                 885 GAT GCA GGGCTG CGG CGG CGC CTG CGG GAG GAG TGG GGC GTG AGC 2700 Asp Ala Gly Leu ArgArg Arg Leu Arg Glu Glu Trp Gly Val Ser                890                 895                 900 TGC TGG ACCCTG CTC CAG GCC CCC GGA GAG GCC GTG CTG GTG CCT 2745 Cys Trp Thr Leu LeuGln Ala Pro Gly Glu Ala Val Leu Val Pro                905                 910                 915 GCA GGG GCTCCC CAC CAG GTG CAG GGC CTG GTG AGC ACA GTC AGC 2790 Ala Gly Ala Pro HisGln Val Gln Gly Leu Val Ser Thr Val Ser                920                 925                 930 GTC ACT CAGCAC TTC CTC TCC CCT GAG ACC TCT GCC CTC TCT GCT 2835 Val Thr Gln His PheLeu Ser Pro Glu Thr Ser Ala Leu Ser Ala                935                 940                 945 CAG CTC TGCCAC CAG GGA CCC AGC CTT CCC CCT GAC TGC CAC CTG 2880 Gln Leu Cys His GlnGly Pro Ser Leu Pro Pro Asp Cys His Leu                950                 955                 960 CTT TAT GCCCAG ATG GAC TGG GCT GTG TTC CAA GCA GTG AAG GTG 2925 Leu Tyr Ala Gln MetAsp Trp Ala Val Phe Gln Ala Val Lys Val                965                 970                 975 GCC GTG CGGACA TTA CAG CAG GCC AAA TAG AGG GAT GCT AGG TGT 2970 Ala Val Gly Thr LeuGln Glu Ala Lys                 980 CTG GGA TCG GGG TGG GGA CAG GTA GACCAG GTG CTC AGC CCA GGC 3015 ACA ACT TCA GCA GGG GAT GGC GCT AGG GGA CTTGGG GAT TTC TGG 3060 TCA ACC CCA CAA GCA CCA CTC TGG GCA CAA GCA GGG CACTCT GTT 3105 CCC CTC CCC CTT AAC CCA ACA ACC ACA GTG CCA CCA ACC TCA CAC3150 CTG TCC TTC TCA GGC TGG CAT CTC CCC CAC CCT GTG CCC TTT TAT 3195GTA CAG G 3202

The hairless portion of the fusion protein used in the abovetranscription assays corresponds to rat hr positions 575-1215 (FIGS.1A-1M). Thus, from the conserved portions of amino acid sequence, afunctionally equivalent fusion protein may be constructed with aminoacid positions 575-1215 or 579-1215 of human Hr (FIGS. 1A-1M). Given thesequence conservation between rat and mouse TRE elements, it is likelythat human Hr and thyroid hormone receptor will bind to the rodent TREelements and that a homologous Hr domain will be involved in binding toa nuclear hormone receptor (e.g., thyroid hormone receptor).

Also, the equivalent TRE element from the human genome will likely besimilar to the rodent TRE element in the rodent hairless genes bysequence conservation. Thus, the human or rodent HR genes may beautoregulated by Hr protein or Hr-dependent.

Table II shows pairwise comparisons of the amino acid sequences for thehuman, rat, and mouse polypeptides. The first 26 amino acid residues ofthe rat polypeptide were not considered because it was longer than anyother sequence. The numerator is the number of non-identical aminoacids. The denominator is the total length of the polypeptide (i.e.,1189 amino acid residue) but it is only 984 amino acid residues inpairwise comparisons with SEQ ID NO:2 because of its shorter length. Thepredicted amino acid sequence of human Hr is about 98% identical to SEQID NO:3, but it is greater than 98% identical to SEQ ID NO:4.

The amino acid sequence of human Hr differs from the sequence of Ahmadet al. at 11 amino acid residues: 472 (Trp→Leu), 515 (Leu→Ala), 598(Arg→Gly), 610 (Gly→Ser), 617 (Gly→Glu), 800 (Glu→Asp), 931 (Gly→Ala),968 (Leu→Phe), 1047 (Asp→Arg), 1055 (Arg→Glu), and 1183 (Pro→Ala). Butthere are only differences between the amino acid sequence of human Hrand the sequence of Cichon et al. at three amino acid residues: 363(Asp→Gly), 515 (Leu→Ala), and 617 (Gly→Glu).

These differences and the percent identity calculated for each pairwisecomparison in Table II suggest that the human sequence published byAhmad et al. contains one or more sequencing errors, or polymorphisms.Furthermore, Ahmad et al. report a missense mutation Thr1022Ala in allfamily members with congenital alopecia (a recessive disease), in theheterozygous state in obligate carriers, but absent in unaffected familymembers as well as 142 unrelated, unaffected individuals. The presenceof the mutation identified by DNA sequencing was confirmed byheteroduplex analysis and restriction enzyme digestion. Ahmad et al.concluded that this adenine to guanine transition was not a normalpolymorphic variant. In contrast, Cichon et al. find the mutation to bepresent in a heterozygous state at a frequency of 1.2% in a differentcontrol population (99% confidence level: 0.5% to 2.2%). If Thr1022Alais a disease-causing recessive mutation, one would have expected theprevalence of congenital alopecia to be at least 1/40,000 (i.e., theexpected frequency of homozygotes calculated from an allele frequency of0.5%). Congenital alopecia, however, is a much rarer disease than1/40,000. Therefore, Cichon et al. concluded that “Thr1022Ala is not, inand of itself, a deleterious change.”

TABLE II Rat Mouse Human Ahmad et al. Cichon et al. SEQ ID  100% NO: 5SEQ ID 95.4%  100% NO: 6 (1134/ 1189) SEQ ID 78.5% 80.1%  100% NO: 2(772/ (786/ 984) 984) SEQ ID 80.0% 80.8% 98.9%  100% NO: 3 (951/ (961/ (11/984) 1189) 1189) SEQ ID 80.2% 81.0% 99.7% 99.1% 100% NO: 4 (953/(963/  (3/984)  (11/1189) 1189) 1189)

Northern Blotting

A human multiple tissue Northern blot was purchased from Clontech andprobed with a 3.5 Kb Eco RI-Xho I fragment from 37MC1. An approximately5 Kb mRNA was clearly detected in brain and heart, and an approximately1.4 Kb mRNA was detected in skeletal muscle. Other tissues (placenta,lung, liver, kidney, pancreas) do not show detectable HR mRNA.

Moreover, human brain multiple tissue Northern blots I and II werepurchased from Clontech to determine the regions of the brain whichexpressed HR. An approximately 5 Kb mRNA was detected in amygdala,caudate nucleus, corpus callosum, hippocampus, hypothalamus, substantianigra, subthalamic nucleus, thalamus, cerebral cortex, medulla, spinalcord, occipital pole, frontal lobe, temporal lobe, putamen, andcerebellum. An additional mRNA of approximately 5.5 Kb was detected onlyin cerebellum.

These results demonstrate that the invention may be used as a lineagemarker using at least one of Hairless polynucleotide or polypeptide, orat least one specific binding molecule to the Hairless gene or itsprotein product (e.g., complementary polynucleotide, specific binder ofHr antigen). Specific cell lineages or developmental stages may bedistinguished.

Southern Blotting

Genomic DNA was prepared from human blood, digested with appropriaterestriction enzymes, separated on a 0.8% agarose gel, denatured in 0.5 MNaOH/1.5 M NaCl buffer, neutralized in 0.5 M TRIS (pH 7.4)/3 M NaClbuffer, and transferred to nitrocellulose membrane with 10×SSC buffer. A3.0 Kb Kpn I fragment from 37MC12 was labeled by random priming,hybridized at 42° C. with the hybridization buffer used for libraryscreening, washed at 65° C. with a wash buffer of 0.5×SSC/0.1% SDS, andthen exposed with X-ray film for 20 hours in a −70° C. freezer. Inseparate, single restriction enzyme digestions, hybridizing fragmentsincluded four Bam HI fragments (about 9 Kb, 4 Kb, 2.1 Kb, 1.5 Kb); oneBgI II fragment (about 22 Kb); two Eco RI fragments (about 20 Kb, 7 Kb);one HindIII fragment (about 18 Kb); or one Xba I fragment (about 25 Kb).The foregoing and the chromosome localization results below areconsistent with the human HR gene existing as a single copy gene in thehaploid genome.

These results demonstrate that the invention may be used to distinguishbetween or among individuals in a human population by detection of atleast one genetic polymorphism, to isolate and detect additionalpolymorphisms, to map other human genes with respect to the Hairlessgenetic locus, or to isolate and purify transcription regulatory regionswhich are genetically linked to the Hairless coding region. Specifichybridization and washing conditions are exemplified. A genomic clonecontaining the entire or partial coding sequence of the human Hairlessgene may be selected from genomic DNA or a library thereof. Specificbinding molecules to the human Hairless gene may also be derived.

Chromosome Localization

The plasmid containing the 3.5 Kb cDNA insert (clone 37MC1) of the humanhomolog of the mouse hairless gene was nick-translated with biotin-14dATP (Gibco-BRL), with 20% incorporation as determined by tritium tracerincorporation. Slides with chromosome spreads were made from normal malelymphocytes cultured with BrdU (Bhatt et al., Nucleic Acids Res. 16,3951-3961, 1988). Fluorescence in situ hybridization was performed asdescribed (Lichter et al., Science 247, 64-69, 1990) with modifications.Probe mix (2×SSCP, 50% formamide, 10% dextran sulfate, 20 ng/μlbiotinylated probe, and 200 g/μl salmon sperm DNA) was denatured at 70°C. for 5 minutes, quickly chilled on ice, placed on slides andhybridized at 37° C. overnight. Slides were washed in 50%formamide/2×SSC at 37° C. for 20 minutes, and two changes of 2×SSC at37° C. for 5 minutes each. Biotinylated probe was detected withFITC-avidin and amplified with biotinylated anti-avidin, using an insitu hybridization kit (Oncor) and manufacturer's instructions.

Analysis of 109 metaphase cells showed 20 cells (18%) had at least onepair of signals (involving both chromatids of a single chromosome).These 20 metaphases were photographed on color slide film (KodakEktachrome 400HC); 25 paired signals were seen. Of these, 22 (96%) werelocated on the p arm of a small C-group (chr. 8-12) chromosome and threewere on other chromosomes (no other site had more than one signal). Todetermine the specific chromosome and band location of the signals, thehybridized slid was G-banded by FPG (fluorescence plus Giemsa),photographed and aligned with the color slides to determine subbandlocation. Seventeen signals were analyzed after banding (an additionalsix signals were on a small C-group chromosome which was probablychromosome 8, but it could not be definitively identified, due toinadequate banding in those metaphases): 14 were on 8p12-21 and theremaining three signals were on different chromosomes (one each on 9p13,14q12, 7q22).

These results demonstrate that the invention may be used as a marker forthe chromosomal locus of the human Hairless gene. Other human genes maybe mapped with respect to the locus; large-scale genetic alterations maybe detected (e.g., amplification, duplication, deletion, inversion,translocation) and isolated by microdissection in the vicinity of thelocus. This is another example of specific hybridization and washingconditions.

While the present invention has been described in connection with whatis presently considered to be practical and preferred embodiments, it isunderstood that the present invention is not to be limited or restrictedto the disclosed embodiments but, on the contrary, is intended to covervarious modifications and equivalent arrangements included within thescope of the appended claims.

Thus, it is to be understood that variations in the described inventionwill be obvious to those skilled in the art without departing from thenovel and non-obvious aspects of the present invention, and suchvariations are intended to come within the scope of the claims below.

10 1 3202 DNA Homo sapiens 1 ttttactaca aggatccgag cattcccagg ttggcaaaggagcccttggc agctgcggaa 60 cctgggttgt ttggcttaaa ctctggtggg cacctgcagagagccgggga ggccgaacgc 120 ccttcactgc accagaggga tggagagatg ggagctggccggcagcagaa tccttgcccg 180 ctcttcctgg ggcagccaga cactgtgccc tggacctcctggcccgcttg tcccccaggc 240 cttgttcata ctcttggcaa cgtctgggct gggccaggcgatgggaacct tgggtaccag 300 ctggggccac cagcaacacc aaggtgcccc tctcctgagccgcctgtcac ccagcggggc 360 tgctgttcat cctacccacc cactaaaggt ggggatcttggcccttgtgg gaagtgccag 420 gagggcctgg aggggggtgc cagtggagcc agcgaacccagcgaggaagt gaacaaggcc 480 tctggcccca gggcctgtcc ccccagccac cacaccaagctgaagaagac atggctcaca 540 cggcactcgg agcagtttga atgtccacgc ggctgccctgaggtcgagga gaggccggtt 600 gctcggctcc gggccctcaa aagggcaggc agccccgaggtccagggagc aatgggcagt 660 ccagccccca agcggccacc ggaccctttc ccaggcactgcagaacaggg ggctgggggt 720 tggcaggagg ttcgggacac atcgataggg aacaaggatgtggactcggg acagcatgat 780 gagcagaaag gaccccaaga tggccaggcc agtctccaggacccgggact tcaggacata 840 ccatgcctgc ttctccctgc aaaactggct caatgccaaagttgtgccca ggcagctgga 900 gagggaggag ggcacgcctg ccactctcag caagtgcggagatcgcctct gggaggggag 960 ctgcagcagg aggaagacac agccaccaac tccagctctgaggaaggccc agggtccggc 1020 cctgacagcc ggctcagcac aggcctcgcc aagcacctgctcagtggttt gggggaccga 1080 ctgtgccgcc tgctgcggag ggagcgggag gccctggcttgggcccagcg ggaaggccaa 1140 gggccagccg tgacagggga cagcccaggc attccacgctgctgcagccg ttgccaccat 1200 ggactcttca acacccactg gcgatgtccc cgctgcagccaccggctgtg tgtggcctgt 1260 ggtcgtgtgg caggcactgg gcgggccagg gagaaagcaggctttcagga gcagtccgcg 1320 gaggagtgca cgcaggaggc cgggcacgct gcctgttccctgatgctgac ccagtttgtc 1380 tccagccagg ctttggcaga gctgagcact gcaatgcaccaggtctgggt caagtttgat 1440 atccgggggc actgcccctg ccaagctgat gcccgggtatgggcccccgg ggatgcaggc 1500 cagcagaagg aatcaacaca gaaaacgccc ccaactccacaaccttcctg caatggcgac 1560 acccacagga ccaagagcat caaagaggag acccccgattccgctgagac cccagcagag 1620 gaccgtgctg gccgagggcc cctgccttgt ccttctctctgcgaactgct ggcttctacc 1680 gcggtcaaac tctgcttggg ccatgagcga atacacatggccttcgcccc cgtcactccg 1740 gccctgccca gtgatgaccg catcaccaac atcctggacagcattatcgc acaggtggtg 1800 gaacggaaga tccaggagaa agccctgggg ccggggcttcgagctggccc gggtctgcgc 1860 aagggcctgg gcctgcccct ctctccagtg cggccccggctgcctccccc aggggctttg 1920 ctgtggctgc aggagcccca gccttgccct cggcgtggcttccacctctt ccaggagcac 1980 tggaggcagg gccagcctgt gttggtgtca gggatccaaaggacattgca gggcaacctg 2040 tgggggacag aagctcttgg ggcacttgga ggccaggtgcaggcgctgag ccccctcgga 2100 cctccccagc ccagcagcct gggcagcaca acattctgggagggcttctc ctggcctgag 2160 cttcgcccaa agtcagacga gggctctgtc ctcctgctgcaccgagcttt gggggatgag 2220 gacaccagca gggtggagaa cctagctgcc agtctgccacttccggagta ctgcgccctc 2280 catggaaaac tcaacctggc ttcctacctc ccaccgggccttgccctgcg tccactggag 2340 ccccagctct gggcagccta tggtgtgagc ccgcaccggggacacctggg gaccaagaac 2400 ctctgtgtgg aggtggccga cctggtcagc atcctggtgcatgccgacac accactgcct 2460 gcctggcacc gggcacagaa agacttcctt tcaggcctggacggggaggg gctctggtct 2520 ccgggcagcc aggtcagcac tgtgtggcac gtgttccgggcacaggacgc ccagcgcatc 2580 cgccgctttc tccagatggt gtgcccggcc ggggcaggcgccctggagcc tggcgcccca 2640 ggcagctgct acctggatgc agggctgcgg cggcgcctgcgggaggagtg gggcgtgagc 2700 tgctggaccc tgctccaggc ccccggagag gccgtgctggtgcctgcagg ggctccccac 2760 caggtgcagg gcctggtgag cacagtcagc gtcactcagcacttcctctc ccctgagacc 2820 tctgccctct ctgctcagct ctgccaccag ggacccagccttccccctga ctgccacctg 2880 ctttatgccc agatggactg ggctgtgttc caagcagtgaaggtggccgt ggggacatta 2940 caggaggcca aatagaggga tgctaggtgt ctgggatcggggtggggaca ggtagaccag 3000 gtgctcagcc caggcacaac ttcagcaggg gatggcgctaggggacttgg ggatttctgg 3060 tcaaccccac aagcaccact ctgggcacaa gcagggcactctgttcccct cccccttaag 3120 ccaacaacca cagtgccacc aagctcacac ctgtccttctcaggctggca tctcccccac 3180 cctgtgccct tttatgtaca gg 3202 2 984 PRT Homosapiens 2 Phe Tyr Tyr Lys Asp Pro Ser Ile Pro Arg Leu Ala Lys Glu ProLeu 1 5 10 15 Ala Ala Ala Glu Pro Gly Leu Phe Gly Leu Asn Ser Gly GlyHis Leu 20 25 30 Gln Arg Ala Gly Glu Ala Glu Arg Pro Ser Leu His Gln ArgAsp Gly 35 40 45 Glu Met Gly Ala Gly Arg Gln Gln Asn Pro Cys Pro Leu PheLeu Gly 50 55 60 Gln Pro Asp Thr Val Pro Trp Thr Ser Trp Pro Ala Cys ProPro Gly 65 70 75 80 Leu Val His Thr Leu Gly Asn Val Trp Ala Gly Pro GlyAsp Gly Asn 85 90 95 Leu Gly Tyr Gln Leu Gly Pro Pro Ala Thr Pro Arg CysPro Ser Pro 100 105 110 Glu Pro Pro Val Thr Gln Arg Gly Cys Cys Ser SerTyr Pro Pro Thr 115 120 125 Lys Gly Gly Asp Leu Gly Pro Cys Gly Lys CysGln Glu Gly Leu Glu 130 135 140 Gly Gly Ala Ser Gly Ala Ser Glu Pro SerGlu Glu Val Asn Lys Ala 145 150 155 160 Ser Gly Pro Arg Ala Cys Pro ProSer His His Thr Lys Leu Lys Lys 165 170 175 Thr Trp Leu Thr Arg His SerGlu Gln Phe Glu Cys Pro Arg Gly Cys 180 185 190 Pro Glu Val Glu Glu ArgPro Val Ala Arg Leu Arg Ala Leu Lys Arg 195 200 205 Ala Gly Ser Pro GluVal Gln Gly Ala Met Gly Ser Pro Ala Pro Lys 210 215 220 Arg Pro Pro AspPro Phe Pro Gly Thr Ala Glu Gln Gly Ala Gly Gly 225 230 235 240 Trp GlnGlu Val Arg Asp Thr Ser Ile Gly Asn Lys Asp Val Asp Ser 245 250 255 GlyGln His Asp Glu Gln Lys Gly Pro Gln Asp Gly Gln Ala Ser Leu 260 265 270Gln Asp Pro Gly Leu Gln Asp Ile Pro Cys Leu Ala Leu Pro Ala Lys 275 280285 Leu Ala Gln Cys Gln Ser Cys Ala Gln Ala Ala Gly Glu Gly Gly Gly 290295 300 His Ala Cys His Ser Gln Gln Val Arg Arg Ser Pro Leu Gly Gly Glu305 310 315 320 Leu Gln Gln Glu Glu Asp Thr Ala Thr Asn Ser Ser Ser GluGlu Gly 325 330 335 Pro Gly Ser Gly Pro Asp Ser Arg Leu Ser Thr Gly LeuAla Lys His 340 345 350 Leu Leu Ser Gly Leu Gly Asp Arg Leu Cys Arg LeuLeu Arg Arg Glu 355 360 365 Arg Glu Ala Leu Ala Trp Ala Gln Arg Glu GlyGln Gly Pro Ala Val 370 375 380 Thr Gly Asp Ser Pro Gly Ile Pro Arg CysCys Ser Arg Cys His His 385 390 395 400 Gly Leu Phe Asn Thr His Trp ArgCys Pro Arg Cys Ser His Arg Leu 405 410 415 Cys Val Ala Cys Gly Arg ValAla Gly Thr Gly Arg Ala Arg Glu Lys 420 425 430 Ala Gly Phe Gln Glu GlnSer Ala Glu Glu Cys Thr Gln Glu Ala Gly 435 440 445 His Ala Ala Cys SerLeu Met Leu Thr Gln Phe Val Ser Ser Gln Ala 450 455 460 Leu Ala Glu LeuSer Thr Ala Met His Gln Val Trp Val Lys Phe Asp 465 470 475 480 Ile ArgGly His Cys Pro Cys Gln Ala Asp Ala Arg Val Trp Ala Pro 485 490 495 GlyAsp Ala Gly Gln Gln Lys Glu Ser Thr Gln Lys Thr Pro Pro Thr 500 505 510Pro Gln Pro Ser Cys Asn Gly Asp Thr His Arg Thr Lys Ser Ile Lys 515 520525 Glu Glu Thr Pro Asp Ser Ala Glu Thr Pro Ala Glu Asp Arg Ala Gly 530535 540 Arg Gly Pro Leu Pro Cys Pro Ser Leu Cys Glu Leu Leu Ala Ser Thr545 550 555 560 Ala Val Lys Leu Cys Leu Gly His Glu Arg Ile His Met AlaPhe Ala 565 570 575 Pro Val Thr Pro Ala Leu Pro Ser Asp Asp Arg Ile ThrAsn Ile Leu 580 585 590 Asp Ser Ile Ile Ala Gln Val Val Glu Arg Lys IleGln Glu Lys Ala 595 600 605 Leu Gly Pro Gly Leu Arg Ala Gly Pro Gly LeuArg Lys Gly Leu Gly 610 615 620 Leu Pro Leu Ser Pro Val Arg Pro Arg LeuPro Pro Pro Gly Ala Leu 625 630 635 640 Leu Trp Leu Gln Glu Pro Gln ProCys Pro Arg Arg Gly Phe His Leu 645 650 655 Phe Gln Glu His Trp Arg GlnGly Gln Pro Val Leu Val Ser Gly Ile 660 665 670 Gln Arg Thr Leu Gln GlyAsn Leu Trp Gly Thr Glu Ala Leu Gly Ala 675 680 685 Leu Gly Gly Gln ValGln Ala Leu Ser Pro Leu Gly Pro Pro Gln Pro 690 695 700 Ser Ser Leu GlySer Thr Thr Phe Trp Glu Gly Phe Ser Trp Pro Glu 705 710 715 720 Leu ArgPro Lys Ser Asp Glu Gly Ser Val Leu Leu Leu His Arg Ala 725 730 735 LeuGly Asp Glu Asp Thr Ser Arg Val Glu Asn Leu Ala Ala Ser Leu 740 745 750Pro Leu Pro Glu Tyr Cys Ala Leu His Gly Lys Leu Asn Leu Ala Ser 755 760765 Tyr Leu Pro Pro Gly Leu Ala Leu Arg Pro Leu Glu Pro Gln Leu Trp 770775 780 Ala Ala Tyr Gly Val Ser Pro His Arg Gly His Leu Gly Thr Lys Asn785 790 795 800 Leu Cys Val Glu Val Ala Asp Leu Val Ser Ile Leu Val HisAla Asp 805 810 815 Thr Pro Leu Pro Ala Trp His Arg Ala Gln Lys Asp PheLeu Ser Gly 820 825 830 Leu Asp Gly Glu Gly Leu Trp Ser Pro Gly Ser GlnVal Ser Thr Val 835 840 845 Trp His Val Phe Arg Ala Gln Asp Ala Gln ArgIle Arg Arg Phe Leu 850 855 860 Gln Met Val Cys Pro Ala Gly Ala Gly AlaLeu Glu Pro Gly Ala Pro 865 870 875 880 Gly Ser Cys Tyr Leu Asp Ala GlyLeu Arg Arg Arg Leu Arg Glu Glu 885 890 895 Trp Gly Val Ser Cys Trp ThrLeu Leu Gln Ala Pro Gly Glu Ala Val 900 905 910 Leu Val Pro Ala Gly AlaPro His Gln Val Gln Gly Leu Val Ser Thr 915 920 925 Val Ser Val Thr GlnHis Phe Leu Ser Pro Glu Thr Ser Ala Leu Ser 930 935 940 Ala Gln Leu CysHis Gln Gly Pro Ser Leu Pro Pro Asp Cys His Leu 945 950 955 960 Leu TyrAla Gln Met Asp Trp Ala Val Phe Gln Ala Val Lys Val Ala 965 970 975 ValGly Thr Leu Gln Glu Ala Lys 980 3 1189 PRT Homo sapiens 3 Met Glu SerThr Pro Ser Phe Leu Lys Gly Thr Pro Thr Trp Glu Lys 1 5 10 15 Thr AlaPro Glu Asn Gly Ile Val Arg Gln Glu Pro Gly Ser Pro Pro 20 25 30 Arg AspGly Leu His His Gly Pro Leu Cys Leu Gly Glu Pro Ala Pro 35 40 45 Phe TrpArg Gly Val Leu Ser Thr Pro Asp Ser Trp Leu Pro Pro Gly 50 55 60 Phe ProGln Gly Pro Lys Asp Met Leu Pro Leu Val Glu Gly Glu Gly 65 70 75 80 ProGln Asn Gly Glu Arg Lys Val Asn Trp Leu Gly Ser Lys Glu Gly 85 90 95 LeuArg Trp Lys Glu Ala Met Leu Thr His Pro Leu Ala Phe Cys Gly 100 105 110Pro Ala Cys Pro Pro Arg Cys Gly Pro Leu Met Pro Glu His Ser Gly 115 120125 Gly His Leu Lys Ser Asp Pro Val Ala Phe Arg Pro Trp His Cys Pro 130135 140 Phe Leu Leu Glu Thr Lys Ile Leu Glu Arg Ala Pro Phe Trp Val Pro145 150 155 160 Thr Cys Leu Pro Pro Tyr Leu Val Ser Gly Leu Pro Pro GluHis Pro 165 170 175 Cys Asp Trp Pro Leu Thr Pro His Pro Trp Val Tyr SerGly Gly Gln 180 185 190 Pro Lys Val Pro Ser Ala Phe Ser Leu Gly Ser LysGly Phe Tyr Tyr 195 200 205 Lys Asp Pro Ser Ile Pro Arg Leu Ala Lys GluPro Leu Ala Ala Ala 210 215 220 Glu Pro Gly Leu Phe Gly Leu Asn Ser GlyGly His Leu Gln Arg Ala 225 230 235 240 Gly Glu Ala Glu Arg Pro Ser LeuHis Gln Arg Asp Gly Glu Met Gly 245 250 255 Ala Gly Arg Gln Gln Asn ProCys Pro Leu Phe Leu Gly Gln Pro Asp 260 265 270 Thr Val Pro Trp Thr SerTrp Pro Ala Cys Pro Pro Gly Leu Val His 275 280 285 Thr Leu Gly Asn ValTrp Ala Gly Pro Gly Asp Gly Asn Leu Gly Tyr 290 295 300 Gln Leu Gly ProPro Ala Thr Pro Arg Cys Pro Ser Pro Glu Pro Pro 305 310 315 320 Val ThrGln Arg Gly Cys Cys Ser Ser Tyr Pro Pro Thr Lys Gly Gly 325 330 335 AspLeu Gly Pro Cys Gly Lys Cys Gln Glu Gly Leu Glu Gly Gly Ala 340 345 350Ser Gly Ala Ser Glu Pro Ser Glu Glu Val Asn Lys Ala Ser Gly Pro 355 360365 Arg Ala Cys Pro Pro Ser His His Thr Lys Leu Lys Lys Thr Trp Leu 370375 380 Thr Arg His Ser Glu Gln Phe Glu Cys Pro Arg Gly Cys Pro Glu Val385 390 395 400 Glu Glu Arg Pro Val Ala Arg Leu Arg Ala Leu Lys Arg AlaGly Ser 405 410 415 Pro Glu Val Gln Gly Ala Met Gly Ser Pro Ala Pro LysArg Pro Pro 420 425 430 Asp Pro Phe Pro Gly Thr Ala Glu Gln Gly Ala GlyGly Leu Gln Glu 435 440 445 Val Arg Asp Thr Ser Ile Gly Asn Lys Asp ValAsp Ser Gly Gln His 450 455 460 Asp Glu Gln Lys Gly Pro Gln Asp Gly GlnAla Ser Leu Gln Asp Pro 465 470 475 480 Gly Leu Gln Asp Ile Pro Cys LeuAla Leu Pro Ala Lys Leu Ala Gln 485 490 495 Cys Gln Ser Cys Ala Gln AlaAla Gly Glu Gly Gly Gly His Ala Cys 500 505 510 His Ser Gln Gln Val ArgArg Ser Pro Leu Gly Gly Glu Leu Gln Gln 515 520 525 Glu Glu Asp Thr AlaThr Asn Ser Ser Ser Glu Glu Gly Pro Gly Ser 530 535 540 Gly Pro Asp SerArg Leu Ser Thr Gly Leu Ala Lys His Leu Leu Ser 545 550 555 560 Gly LeuGly Asp Arg Leu Cys Arg Leu Leu Arg Gly Glu Arg Glu Ala 565 570 575 LeuAla Trp Ala Gln Arg Glu Ser Gln Gly Pro Ala Val Thr Glu Asp 580 585 590Ser Pro Gly Ile Pro Arg Cys Cys Ser Arg Cys His His Gly Leu Phe 595 600605 Asn Thr His Trp Arg Cys Pro Arg Cys Ser His Arg Leu Cys Val Ala 610615 620 Cys Gly Arg Val Ala Gly Thr Gly Arg Ala Arg Glu Lys Ala Gly Phe625 630 635 640 Gln Glu Gln Ser Ala Glu Glu Cys Thr Gln Glu Ala Gly HisAla Ala 645 650 655 Cys Ser Leu Met Leu Thr Gln Phe Val Ser Ser Gln AlaLeu Ala Glu 660 665 670 Leu Ser Thr Ala Met His Gln Val Trp Val Lys PheAsp Ile Arg Gly 675 680 685 His Cys Pro Cys Gln Ala Asp Ala Arg Val TrpAla Pro Gly Asp Ala 690 695 700 Gly Gln Gln Lys Glu Ser Thr Gln Lys ThrPro Pro Thr Pro Gln Pro 705 710 715 720 Ser Cys Asn Gly Asp Thr His ArgThr Lys Ser Ile Lys Glu Glu Thr 725 730 735 Pro Asp Ser Ala Glu Thr ProAla Glu Asp Arg Ala Gly Arg Gly Pro 740 745 750 Leu Pro Cys Pro Ser LeuCys Glu Leu Leu Ala Ser Thr Ala Val Lys 755 760 765 Leu Cys Leu Gly HisAsp Arg Ile His Met Ala Phe Ala Pro Val Thr 770 775 780 Pro Ala Leu ProSer Asp Asp Arg Ile Thr Asn Ile Leu Asp Ser Ile 785 790 795 800 Ile AlaGln Val Val Glu Arg Lys Ile Gln Glu Lys Ala Leu Gly Pro 805 810 815 GlyLeu Arg Ala Gly Pro Gly Leu Arg Lys Gly Leu Gly Leu Pro Leu 820 825 830Ser Pro Val Arg Pro Arg Leu Pro Pro Pro Gly Ala Leu Leu Trp Leu 835 840845 Gln Glu Pro Gln Pro Cys Pro Arg Arg Gly Phe His Leu Phe Gln Glu 850855 860 His Trp Arg Gln Gly Gln Pro Val Leu Val Ser Gly Ile Gln Arg Thr865 870 875 880 Leu Gln Gly Asn Leu Trp Gly Thr Glu Ala Leu Gly Ala LeuGly Gly 885 890 895 Gln Val Gln Ala Leu Ser Pro Leu Ala Pro Pro Gln ProSer Ser Leu 900 905 910 Gly Ser Thr Thr Phe Trp Glu Gly Phe Ser Trp ProGlu Leu Arg Pro 915 920 925 Lys Ser Asp Glu Gly Ser Val Leu Leu Leu HisArg Ala Phe Gly Asp 930 935 940 Glu Asp Thr Ser Arg Val Glu Asn Leu AlaAla Ser Leu Pro Leu Pro 945 950 955 960 Glu Tyr Cys Ala Leu His Gly LysLeu Asn Leu Ala Ser Tyr Leu Pro 965 970 975 Pro Gly Leu Ala Leu Arg ProLeu Glu Pro Gln Leu Trp Ala Ala Tyr 980 985 990 Gly Val Ser Pro His ArgGly His Leu Gly Thr Lys Asn Leu Cys Val 995 1000 1005 Glu Val Ala AspLeu Val Ser Ile Leu Val His Ala Arg Thr Pro Leu 1010 1015 1020 Pro AlaTrp His Glu Ala Gln Lys Asp Phe Leu Ser Gly Leu Asp Gly 1025 1030 10351040 Glu Gly Leu Trp Ser Pro Gly Ser Gln Val Ser Thr Val Trp His Val1045 1050 1055 Phe Arg Ala Gln Asp Ala Gln Arg Ile Arg Arg Phe Leu GlnMet Val 1060 1065 1070 Cys Pro Ala Gly Ala Gly Ala Leu Glu Pro Gly AlaPro Gly Ser Cys 1075 1080 1085 Tyr Leu Asp Ala Gly Leu Arg Arg Arg LeuArg Glu Glu Trp Gly Val 1090 1095 1100 Ser Cys Trp Thr Leu Leu Gln AlaPro Gly Glu Ala Val Leu Val Pro 1105 1110 1115 1120 Ala Gly Ala Pro HisGln Val Gln Gly Leu Val Ser Thr Val Ser Val 1125 1130 1135 Thr Gln HisPhe Leu Ser Pro Glu Thr Ser Ala Leu Ser Ala Gln Leu 1140 1145 1150 CysHis Gln Gly Ala Ser Leu Pro Pro Asp Cys His Leu Leu Tyr Ala 1155 11601165 Gln Met Asp Trp Ala Val Phe Gln Ala Val Lys Val Ala Val Gly Thr1170 1175 1180 Leu Gln Glu Ala Lys 1185 4 1189 PRT Homo sapiens 4 MetGlu Ser Thr Pro Ser Phe Leu Lys Gly Thr Pro Thr Trp Glu Lys 1 5 10 15Thr Ala Pro Glu Asn Gly Ile Val Arg Gln Glu Pro Gly Ser Pro Pro 20 25 30Arg Asp Gly Leu His His Gly Pro Leu Cys Leu Gly Glu Pro Ala Pro 35 40 45Phe Trp Arg Gly Val Leu Ser Thr Pro Asp Ser Trp Leu Pro Pro Gly 50 55 60Phe Pro Gln Gly Pro Lys Asp Met Leu Pro Leu Val Glu Gly Glu Gly 65 70 7580 Pro Gln Asn Gly Glu Arg Lys Val Asn Trp Leu Gly Ser Lys Glu Gly 85 9095 Leu Arg Trp Lys Glu Ala Met Leu Thr His Pro Leu Ala Phe Cys Gly 100105 110 Pro Ala Cys Pro Pro Arg Cys Gly Pro Leu Met Pro Glu His Ser Gly115 120 125 Gly His Leu Lys Ser Asp Pro Val Ala Phe Arg Pro Trp His CysPro 130 135 140 Phe Leu Leu Glu Thr Lys Ile Leu Glu Arg Ala Pro Phe TrpVal Pro 145 150 155 160 Thr Cys Leu Pro Pro Tyr Leu Val Ser Gly Leu ProPro Glu His Pro 165 170 175 Cys Asp Trp Pro Leu Thr Pro His Pro Trp ValTyr Ser Gly Gly Gln 180 185 190 Pro Lys Val Pro Ser Ala Phe Ser Leu GlySer Lys Gly Phe Tyr Tyr 195 200 205 Lys Asp Pro Ser Ile Pro Arg Leu AlaLys Glu Pro Leu Ala Ala Ala 210 215 220 Glu Pro Gly Leu Phe Gly Leu AsnSer Gly Gly His Leu Gln Arg Ala 225 230 235 240 Gly Glu Ala Glu Arg ProSer Leu His Gln Arg Asp Gly Glu Met Gly 245 250 255 Ala Gly Arg Gln GlnAsn Pro Cys Pro Leu Phe Leu Gly Gln Pro Asp 260 265 270 Thr Val Pro TrpThr Ser Trp Pro Ala Cys Pro Pro Gly Leu Val His 275 280 285 Thr Leu GlyAsn Val Trp Ala Gly Pro Gly Asp Gly Asn Leu Gly Tyr 290 295 300 Gln LeuGly Pro Pro Ala Thr Pro Arg Cys Pro Ser Pro Glu Pro Pro 305 310 315 320Val Thr Gln Arg Gly Cys Cys Ser Ser Tyr Pro Pro Thr Lys Gly Gly 325 330335 Gly Leu Gly Pro Cys Gly Lys Cys Gln Glu Gly Leu Glu Gly Gly Ala 340345 350 Ser Gly Ala Ser Glu Pro Ser Glu Glu Val Asn Lys Ala Ser Gly Pro355 360 365 Arg Ala Cys Pro Pro Ser His His Thr Lys Leu Lys Lys Thr TrpLeu 370 375 380 Thr Arg His Ser Glu Gln Phe Glu Cys Pro Arg Gly Cys ProGlu Val 385 390 395 400 Glu Glu Arg Pro Val Ala Arg Leu Arg Ala Leu LysArg Ala Gly Ser 405 410 415 Pro Glu Val Gln Gly Ala Met Gly Ser Pro AlaPro Lys Arg Pro Pro 420 425 430 Asp Pro Phe Pro Gly Thr Ala Glu Gln GlyAla Gly Gly Trp Gln Glu 435 440 445 Val Arg Asp Thr Ser Ile Gly Asn LysAsp Val Asp Ser Gly Gln His 450 455 460 Asp Glu Gln Lys Gly Pro Gln AspGly Gln Ala Ser Leu Gln Asp Pro 465 470 475 480 Gly Leu Gln Asp Ile ProCys Leu Ala Leu Pro Ala Lys Leu Ala Gln 485 490 495 Cys Gln Ser Cys AlaGln Ala Ala Gly Glu Gly Gly Gly His Ala Cys 500 505 510 His Ser Gln GlnVal Arg Arg Ser Pro Leu Gly Gly Glu Leu Gln Gln 515 520 525 Glu Glu AspThr Ala Thr Asn Ser Ser Ser Glu Glu Gly Pro Gly Ser 530 535 540 Gly ProAsp Ser Arg Leu Ser Thr Gly Leu Ala Lys His Leu Leu Ser 545 550 555 560Gly Leu Gly Asp Arg Leu Cys Arg Leu Leu Arg Arg Glu Arg Glu Ala 565 570575 Leu Ala Trp Ala Gln Arg Glu Gly Gln Gly Pro Ala Val Thr Glu Asp 580585 590 Ser Pro Gly Ile Pro Arg Cys Cys Ser Arg Cys His His Gly Leu Phe595 600 605 Asn Thr His Trp Arg Cys Pro Arg Cys Ser His Arg Leu Cys ValAla 610 615 620 Cys Gly Arg Val Ala Gly Thr Gly Arg Ala Arg Glu Lys AlaGly Phe 625 630 635 640 Gln Glu Gln Ser Ala Glu Glu Cys Thr Gln Glu AlaGly His Ala Ala 645 650 655 Cys Ser Leu Met Leu Thr Gln Phe Val Ser SerGln Ala Leu Ala Glu 660 665 670 Leu Ser Thr Ala Met His Gln Val Trp ValLys Phe Asp Ile Arg Gly 675 680 685 His Cys Pro Cys Gln Ala Asp Ala ArgVal Trp Ala Pro Gly Asp Ala 690 695 700 Gly Gln Gln Lys Glu Ser Thr GlnLys Thr Pro Pro Thr Pro Gln Pro 705 710 715 720 Ser Cys Asn Gly Asp ThrHis Arg Thr Lys Ser Ile Lys Glu Glu Thr 725 730 735 Pro Asp Ser Ala GluThr Pro Ala Glu Asp Arg Ala Gly Arg Gly Pro 740 745 750 Leu Pro Cys ProSer Leu Cys Glu Leu Leu Ala Ser Thr Ala Val Lys 755 760 765 Leu Cys LeuGly His Glu Arg Ile His Met Ala Phe Ala Pro Val Thr 770 775 780 Pro AlaLeu Pro Ser Asp Asp Arg Ile Thr Asn Ile Leu Asp Ser Ile 785 790 795 800Ile Ala Gln Val Val Glu Arg Lys Ile Gln Glu Lys Ala Leu Gly Pro 805 810815 Gly Leu Arg Ala Gly Pro Gly Leu Arg Lys Gly Leu Gly Leu Pro Leu 820825 830 Ser Pro Val Arg Pro Arg Leu Pro Pro Pro Gly Ala Leu Leu Trp Leu835 840 845 Gln Glu Pro Gln Pro Cys Pro Arg Arg Gly Phe His Leu Phe GlnGlu 850 855 860 His Trp Arg Gln Gly Gln Pro Val Leu Val Ser Gly Ile GlnArg Thr 865 870 875 880 Leu Gln Gly Asn Leu Trp Gly Thr Glu Ala Leu GlyAla Leu Gly Gly 885 890 895 Gln Val Gln Ala Leu Ser Pro Leu Gly Pro ProGln Pro Ser Ser Leu 900 905 910 Gly Ser Thr Thr Phe Trp Glu Gly Phe SerTrp Pro Glu Leu Arg Pro 915 920 925 Lys Ser Asp Glu Gly Ser Val Leu LeuLeu His Arg Ala Leu Gly Asp 930 935 940 Glu Asp Thr Ser Arg Val Glu AsnLeu Ala Ala Ser Leu Pro Leu Pro 945 950 955 960 Glu Tyr Cys Ala Leu HisGly Lys Leu Asn Leu Ala Ser Tyr Leu Pro 965 970 975 Pro Gly Leu Ala LeuArg Pro Leu Glu Pro Gln Leu Trp Ala Ala Tyr 980 985 990 Gly Val Ser ProHis Arg Gly His Leu Gly Thr Lys Asn Leu Cys Val 995 1000 1005 Glu ValAla Asp Leu Val Ser Ile Leu Val His Ala Asp Thr Pro Leu 1010 1015 1020Pro Ala Trp His Arg Ala Gln Lys Asp Phe Leu Ser Gly Leu Asp Gly 10251030 1035 1040 Glu Gly Leu Trp Ser Pro Gly Ser Gln Val Ser Thr Val TrpHis Val 1045 1050 1055 Phe Arg Ala Gln Asp Ala Gln Arg Ile Arg Arg PheLeu Gln Met Val 1060 1065 1070 Cys Pro Ala Gly Ala Gly Ala Leu Glu ProGly Ala Pro Gly Ser Cys 1075 1080 1085 Tyr Leu Asp Ala Gly Leu Arg ArgArg Leu Arg Glu Glu Trp Gly Val 1090 1095 1100 Ser Cys Trp Thr Leu LeuGln Ala Pro Gly Glu Ala Val Leu Val Pro 1105 1110 1115 1120 Ala Gly AlaPro His Gln Val Gln Gly Leu Val Ser Thr Val Ser Val 1125 1130 1135 ThrGln His Phe Leu Ser Pro Glu Thr Ser Ala Leu Ser Ala Gln Leu 1140 11451150 Cys His Gln Gly Pro Ser Leu Pro Pro Asp Cys His Leu Leu Tyr Ala1155 1160 1165 Gln Met Asp Trp Ala Val Phe Gln Ala Val Lys Val Ala ValGly Thr 1170 1175 1180 Leu Gln Glu Ala Lys 1185 5 1207 PRT Rat 5 Met GlyLeu Arg Ser Ser Cys Phe Val Leu Thr Leu Gln Asp Pro Pro 1 5 10 15 LeuGly Glu Pro His Glu Gly Arg Arg Val Met Glu Ser Met Pro Ser 20 25 30 PheLeu Lys Asp Thr Pro Ala Trp Glu Lys Thr Ala Pro Val Asn Gly 35 40 45 IleVal Gly Gln Glu Pro Gly Thr Ser Pro Gln Asp Gly Leu His His 50 55 60 GlyAla Leu Cys Leu Gly Glu Pro Val Pro Phe Trp Arg Gly Val Leu 65 70 75 80Ser Ala Pro Asp Ser Trp Leu Pro Pro Gly Phe Leu Gln Gly Pro Lys 85 90 95Asp Thr Leu Ser Val Val Glu Gly Glu Gly Ser Arg Asn Gly Glu Arg 100 105110 Lys Ala Asn Trp Leu Gly Ser Lys Glu Gly Leu Arg Trp Lys Glu Ala 115120 125 Met Leu Ala His Pro Leu Ala Phe Cys Gly Pro Ala Cys Pro Pro Arg130 135 140 Tyr Gly Pro Leu Ile Pro Glu His Ser Ser Gly His Pro Lys SerAsp 145 150 155 160 Pro Val Ala Phe Arg Pro Leu His Cys Pro Phe Leu LeuGlu Thr Lys 165 170 175 Ile Leu Glu Arg Ala Pro Phe Trp Val Pro Thr CysLeu Pro Pro Tyr 180 185 190 Leu Met Ser Ser Leu Pro Pro Glu Arg Ser TyrAsp Trp Pro Leu Ala 195 200 205 Pro Ser Pro Trp Val Tyr Ser Gly Ser GlnPro Lys Val Pro Ser Ala 210 215 220 Phe Ser Leu Gly Ser Lys Gly Phe TyrHis Lys Asp Pro Asn Ile Leu 225 230 235 240 Arg Pro Ala Lys Glu Pro LeuAla Ala Ser Glu Ser Gly Met Leu Gly 245 250 255 Leu Ala Pro Gly Gly HisLeu Gln Gln Ala Cys Asp Ala Glu Gly Pro 260 265 270 Ser Leu His Gln ArgAsp Gly Glu Thr Gly Ala Gly Arg Gln Gln Asn 275 280 285 Leu Cys Pro ValPhe Leu Gly Tyr Pro Asp Thr Val Pro Arg Thr Pro 290 295 300 Trp Pro SerCys Pro Pro Gly Leu Val His Thr Leu Gly Asn Val Trp 305 310 315 320 AlaGly Pro Gly Ser Asn Ser Phe Gly Tyr Gln Leu Gly Pro Pro Val 325 330 335Thr Pro Arg Cys Pro Ser Pro Gly Pro Pro Thr Pro Pro Gly Gly Cys 340 345350 Cys Ser Ser His Leu Pro Ala Arg Glu Gly Asp Pro Gly Pro Cys Arg 355360 365 Lys Cys Gln Asp Ser Pro Glu Gly Ser Ser Ser Gly Pro Gly Glu Ser370 375 380 Ser Glu Glu Arg Asn Lys Ala Gly Ser Arg Ala Ser Pro Pro SerHis 385 390 395 400 His Thr Lys Leu Lys Lys Thr Trp Leu Thr Arg His SerGlu Gln Phe 405 410 415 Glu Cys Pro Gly Gly Cys Pro Gly Lys Gly Glu SerPro Ala Thr Gly 420 425 430 Leu Arg Ala Leu Lys Arg Ala Gly Ser Pro GluVal Gln Gly Ala Arg 435 440 445 Gly Pro Ala Pro Lys Arg Pro Ser His ThrPhe Pro Gly Thr Gly Arg 450 455 460 Gln Gly Ala Arg Ala Trp Gln Glu ThrPro Glu Thr Ser Thr Gly Ser 465 470 475 480 Lys Ala Glu Ala Gln Gln GlnGlu Glu Gln Arg Gly Pro Arg Asp Gly 485 490 495 Arg Ile Arg Leu Arg GluSer Arg Leu Glu Asp Thr Ser Cys Gln His 500 505 510 His Leu Ala Gly ValThr Gln Cys Pro Ser Cys Val Gln Ala Ala Gly 515 520 525 Glu Val Glu IleLeu Thr Ser His Ser Gln Lys Ser His Lys Leu Pro 530 535 540 Leu Glu GluLys Pro Leu Glu Glu Asp Ser Cys Ala Thr Ser Glu Glu 545 550 555 560 GlyGly Gly Ser Ser Pro Glu Ala Ser Ile Asn Lys Gly Leu Ala Lys 565 570 575His Leu Leu Ser Gly Leu Gly Asp Arg Leu Cys Arg Leu Leu Arg Lys 580 585590 Glu Arg Glu Ala Leu Ala Trp Ala Gln Arg Glu Gly Gln Gly Pro Ala 595600 605 Met Thr Glu Asp Ser Pro Gly Ile Pro His Cys Cys Ser Arg Cys His610 615 620 His Gly Leu Phe Asn Thr His Trp Arg Cys Ser His Cys Ser HisArg 625 630 635 640 Leu Cys Val Ala Cys Gly Arg Ile Ala Gly Ala Gly LysAsn Arg Glu 645 650 655 Lys Thr Gly Ser Arg Glu Gln Arg Thr Asp Asp CysAla Gln Glu Ala 660 665 670 Gly His Ala Ala Cys Ser Leu Ile Leu Thr GlnPhe Val Ser Ser Gln 675 680 685 Ala Leu Ala Glu Leu Ser Thr Val Met HisGln Val Trp Ala Lys Phe 690 695 700 Asp Ile Arg Gly His Cys Phe Cys GlnVal Asp Ala Arg Val Trp Ala 705 710 715 720 Pro Gly Asp Gly Gly Gln GlnLys Glu Pro Thr Glu Lys Thr Pro Pro 725 730 735 Ala Pro Gln Leu Ser CysAsn Gly Asp Ser Asn Arg Thr Lys Asp Ile 740 745 750 Lys Glu Glu Thr ProAsp Ser Thr Glu Ser Pro Ala Glu Asp Arg Ala 755 760 765 Gly Arg Ser ProLeu Pro Cys Pro Ser Leu Cys Glu Leu Leu Ala Ser 770 775 780 Thr Ala ValLys Leu Cys Leu Gly His Glu Arg Ile His Met Ala Phe 785 790 795 800 AlaPro Val Thr Pro Ala Leu Pro Ser Asp Asp Arg Ile Thr Asn Ile 805 810 815Leu Asp Ser Ile Ile Ala Gln Val Val Glu Arg Lys Ile Gln Glu Lys 820 825830 Ala Leu Gly Pro Gly Leu Arg Ala Gly Ser Gly Leu Arg Lys Gly Leu 835840 845 Ser Leu Pro Leu Ser Pro Val Arg Thr Gln Leu Ser Pro Pro Gly Ala850 855 860 Leu Leu Trp Leu Gln Glu Pro Arg Pro Lys His Gly Phe Arg LeuPhe 865 870 875 880 Gln Glu His Trp Arg Gln Gly Gln Pro Val Leu Val SerGly Ile Gln 885 890 895 Lys Thr Leu Arg Leu Ser Leu Trp Gly Met Glu AlaLeu Gly Thr Leu 900 905 910 Gly Gly Gln Val Gln Thr Leu Thr Ala Leu GlyPro Pro Gln Pro Thr 915 920 925 Ser Leu Asp Ser Thr Ala Phe Trp Lys GlyPhe Ser His Pro Glu Ala 930 935 940 Arg Pro Lys Leu Asp Glu Gly Ser ValLeu Leu Leu His Arg Pro Leu 945 950 955 960 Gly Asp Lys Asp Glu Ser ArgVal Glu Asn Leu Ala Ser Ser Leu Pro 965 970 975 Leu Pro Glu Tyr Cys AlaHis Gln Gly Lys Leu Asn Leu Ala Ser Tyr 980 985 990 Leu Pro Leu Gly LeuThr Leu His Pro Leu Glu Pro Gln Leu Trp Ala 995 1000 1005 Ala Tyr GlyVal Asn Ser His Arg Gly His Leu Gly Thr Lys Asn Leu 1010 1015 1020 CysVal Glu Val Ser Asp Leu Ile Ser Ile Leu Val His Ala Glu Ala 1025 10301035 1040 Gln Leu Pro Pro Trp Tyr Arg Ala Gln Lys Asp Phe Leu Ser GlyLeu 1045 1050 1055 Asp Gly Glu Gly Leu Trp Ser Pro Gly Ser Gln Thr SerThr Val Trp 1060 1065 1070 His Val Phe Arg Ala Gln Asp Ala Gln Arg IleArg Arg Phe Leu Gln 1075 1080 1085 Met Val Cys Pro Ala Gly Ala Gly ThrLeu Glu Pro Gly Ala Pro Gly 1090 1095 1100 Ser Cys Tyr Leu Asp Ser GlyLeu Arg Arg Arg Leu Arg Glu Glu Trp 1105 1110 1115 1120 Gly Val Ser CysTrp Thr Leu Leu Gln Ala Pro Gly Glu Ala Val Leu 1125 1130 1135 Val ProAla Gly Ala Pro His Gln Val Gln Gly Leu Val Ser Thr Ile 1140 1145 1150Ser Val Thr Gln His Phe Leu Ser Pro Glu Thr Ser Ala Leu Ser Ala 11551160 1165 Gln Leu Cys His Gln Gly Ala Ser Leu Pro Pro Asp His Arg MetLeu 1170 1175 1180 Tyr Ala Gln Met Asp Arg Ala Val Phe Gln Ala Val LysVal Ala Val 1185 1190 1195 1200 Gly Thr Leu Gln Glu Ala Lys 1205 6 1182PRT Mouse 6 Met Glu Ser Met Pro Ser Phe Leu Lys Asp Thr Pro Ala Trp GluLys 1 5 10 15 Thr Ala Pro Val Asn Gly Ile Val Gly Gln Glu Pro Gly ThrSer Pro 20 25 30 Gln Asp Gly Leu Arg His Gly Ala Leu Cys Leu Gly Glu ProAla Pro 35 40 45 Phe Trp Arg Gly Val Leu Ser Thr Pro Asp Ser Trp Leu ProPro Gly 50 55 60 Phe Leu Gln Gly Pro Lys Asp Thr Leu Ser Leu Val Glu GlyGlu Gly 65 70 75 80 Pro Arg Asn Gly Glu Arg Lys Gly Ser Trp Leu Gly GlyLys Glu Gly 85 90 95 Leu Arg Trp Lys Glu Ala Met Leu Ala His Pro Leu AlaPhe Cys Gly 100 105 110 Pro Ala Cys Pro Pro Arg Tyr Gly Pro Leu Ile ProGlu His Ser Gly 115 120 125 Gly His Pro Lys Ser Asp Pro Val Ala Phe ArgPro Leu His Cys Pro 130 135 140 Phe Leu Leu Glu Thr Lys Ile Leu Glu ArgAla Pro Phe Trp Val Pro 145 150 155 160 Thr Cys Leu Pro Pro Tyr Leu MetSer Ser Leu Pro Pro Glu Arg Pro 165 170 175 Tyr Asp Trp Pro Leu Ala ProAsn Pro Trp Val Tyr Ser Gly Ser Gln 180 185 190 Pro Lys Val Pro Ser AlaPhe Gly Leu Gly Ser Lys Gly Phe Tyr His 195 200 205 Lys Asp Pro Asn IleLeu Arg Pro Ala Lys Glu Pro Leu Ala Glu Ser 210 215 220 Gly Met Leu GlyLeu Ala Pro Gly Gly His Leu Gln Gln Ala Cys Glu 225 230 235 240 Ser GluGly Pro Ser Leu His Gln Arg Asp Gly Glu Thr Gly Ala Gly 245 250 255 ArgGln Gln Asn Leu Cys Pro Val Phe Leu Gly Tyr Pro Asp Thr Val 260 265 270Pro Arg Ala Pro Trp Pro Ser Cys Pro Pro Gly Leu Val His Ser Leu 275 280285 Gly Asn Ile Trp Ala Gly Pro Gly Ser Asn Ser Leu Gly Tyr Gln Leu 290295 300 Gly Pro Pro Ala Thr Pro Arg Cys Pro Ser Pro Gly Pro Pro Thr Pro305 310 315 320 Pro Gly Gly Cys Cys Ser Ser His Leu Pro Ala Arg Glu GlyAsp Leu 325 330 335 Gly Pro Cys Arg Lys Cys Gln Asp Ser Pro Glu Gly GlySer Ser Gly 340 345 350 Pro Gly Glu Ser Ser Glu Glu Arg Asn Lys Ala AspSer Arg Ala Cys 355 360 365 Pro Pro Ser His His Thr Lys Leu Lys Lys ThrTrp Leu Thr Arg His 370 375 380 Ser Glu Gln Phe Glu Cys Pro Gly Gly CysSer Gly Lys Glu Glu Ser 385 390 395 400 Ser Ala Thr Gly Leu Arg Ala LeuLys Arg Ala Gly Ser Pro Glu Val 405 410 415 Gln Gly Ala Ser Arg Gly ProAla Pro Lys Arg Pro Ser His Pro Phe 420 425 430 Pro Gly Thr Gly Arg GlnGly Ala Arg Ala Trp Gln Glu Thr Pro Glu 435 440 445 Thr Ile Ile Gly SerLys Ala Glu Ala Glu Gln Gln Glu Glu Gln Arg 450 455 460 Gly Pro Arg AspGly Arg Ile Arg Leu Gln Glu Ser Arg Leu Val Asp 465 470 475 480 Thr SerCys Gln His His Leu Ala Gly Val Thr Gln Cys Gln Ser Cys 485 490 495 ValGln Ala Ala Gly Glu Val Gly Val Leu Thr Gly His Ser Gln Lys 500 505 510Ser Arg Arg Ser Pro Leu Glu Glu Lys Gln Leu Glu Glu Glu Asp Ser 515 520525 Ser Ala Thr Ser Glu Glu Gly Gly Gly Gly Pro Gly Pro Glu Ala Ser 530535 540 Leu Asn Lys Gly Leu Ala Lys His Leu Leu Ser Gly Leu Gly Asp Arg545 550 555 560 Leu Cys Arg Leu Leu Arg Lys Glu Arg Glu Ala Leu Ala TrpAla Gln 565 570 575 Arg Glu Gly Gln Gly Pro Ala Met Thr Glu Asp Ser ProGly Ile Pro 580 585 590 His Cys Cys Ser Arg Cys His His Gly Leu Phe AsnThr His Trp Arg 595 600 605 Cys Ser His Cys Ser His Arg Leu Cys Val AlaCys Gly Arg Ile Ala 610 615 620 Gly Ala Gly Lys Asn Arg Glu Lys Thr GlySer Gln Glu Gln His Thr 625 630 635 640 Asp Asp Cys Ala Gln Glu Ala GlyHis Ala Ala Cys Ser Leu Ile Leu 645 650 655 Thr Gln Phe Val Ser Ser GlnAla Leu Ala Glu Leu Ser Thr Val Met 660 665 670 His Gln Val Trp Ala LysPhe Asp Ile Arg Gly His Cys Phe Cys Gln 675 680 685 Val Asp Ala Arg ValTrp Ala Pro Gly Asp Gly Gly Gln Gln Lys Glu 690 695 700 Pro Thr Glu LysThr Pro Pro Thr Pro Gln Pro Ser Cys Asn Gly Asp 705 710 715 720 Ser AsnArg Thr Lys Asp Ile Lys Glu Glu Thr Pro Asp Ser Thr Glu 725 730 735 SerPro Ala Glu Asp Gly Ala Gly Arg Ser Pro Leu Pro Cys Pro Ser 740 745 750Leu Cys Glu Leu Leu Ala Ser Thr Ala Val Lys Leu Cys Leu Gly His 755 760765 Asp Arg Ile His Met Ala Phe Ala Pro Val Thr Pro Ala Leu Pro Ser 770775 780 Asp Asp Arg Ile Thr Asn Ile Leu Asp Ser Ile Ile Ala Gln Val Val785 790 795 800 Glu Arg Lys Ile Gln Glu Lys Ala Leu Gly Pro Gly Leu ArgAla Gly 805 810 815 Ser Gly Leu Arg Lys Gly Leu Ser Leu Pro Leu Ser ProVal Arg Thr 820 825 830 Arg Leu Ser Pro Pro Gly Ala Leu Leu Trp Leu GlnGlu Pro Arg Pro 835 840 845 Lys His Gly Phe His Leu Phe Gln Glu His TrpArg Gln Gly Gln Pro 850 855 860 Val Leu Val Ser Gly Ile Gln Lys Thr LeuArg Leu Ser Leu Trp Gly 865 870 875 880 Met Glu Ala Leu Gly Thr Leu GlyGly Gln Val Gln Thr Leu Thr Ala 885 890 895 Leu Gly Pro Pro Gln Pro ThrAsn Leu Asp Ser Thr Ala Phe Trp Glu 900 905 910 Gly Phe Ser His Pro GluThr Arg Pro Lys Leu Asp Glu Gly Ser Val 915 920 925 Leu Leu Leu His ArgThr Leu Gly Asp Lys Asp Ala Ser Arg Val Gln 930 935 940 Asn Leu Ala SerSer Leu Pro Leu Pro Glu Tyr Cys Ala His Gln Gly 945 950 955 960 Lys LeuAsn Leu Ala Ser Tyr Leu Pro Leu Gly Leu Thr Leu His Pro 965 970 975 LeuGlu Pro Gln Leu Trp Ala Ala Tyr Gly Val Asn Ser His Arg Gly 980 985 990His Leu Gly Thr Lys Asn Leu Cys Val Glu Val Ser Asp Leu Ile Ser 995 10001005 Ile Leu Val His Ala Glu Ala Gln Leu Pro Pro Trp Tyr Arg Ala Gln1010 1015 1020 Lys Asp Phe Leu Ser Gly Leu Asp Gly Glu Gly Leu Trp SerPro Gly 1025 1030 1035 1040 Ser Gln Thr Ser Thr Val Trp His Val Phe ArgAla Gln Asp Ala Gln 1045 1050 1055 Arg Ile Arg Arg Phe Leu Gln Met ValCys Pro Ala Gly Ala Gly Thr 1060 1065 1070 Leu Glu Pro Gly Ala Pro GlySer Cys Tyr Leu Asp Ala Gly Leu Arg 1075 1080 1085 Arg Arg Leu Arg GluGlu Trp Gly Val Ser Cys Trp Thr Leu Leu Gln 1090 1095 1100 Ala Pro GlyGlu Ala Val Leu Val Pro Ala Gly Ala Pro His Gln Val 1105 1110 1115 1120Gln Gly Leu Val Ser Thr Ile Ser Val Thr Gln His Phe Leu Ser Pro 11251130 1135 Glu Thr Ser Ala Leu Ser Ala Gln Leu Tyr His Gln Gly Ala SerLeu 1140 1145 1150 Pro Pro Asp His Arg Met Leu Tyr Ala Gln Met Asp ArgAla Val Phe 1155 1160 1165 Gln Ala Val Lys Ala Ala Val Gly Ala Leu GlnGlu Ala Lys 1170 1175 1180 7 111 DNA Rat 7 agtcaggcca agctgccactaatccgggcg gggagagggg gggcacccac gtcagagcgg 60 ggactgccgg gtggagggcatctgaggaca tcccctccca ctcacgcggc c 111 8 111 DNA Mouse 8 agtcaggccaatctgccact aatccgggcg gggagagggg gggcacccac ctcagagcgg 60 ggactgccgggtggagggca tctgaggaca tcccctccca ctcacgcagc c 111 9 23 DNA Rat 9ggtggagggc atctgaggac atc 23 10 21 DNA Rat 10 tggagggcat ctgaggacat c 21

I claim:
 1. An expression construct comprising at least one humannucleotide sequence operably linked to a transcriptional regulatoryregion such that at least one human nucleotide sequence or itscomplement is transcribed; wherein said human nucleotide sequence isselected from the group consisting of: (a) a nucleotide sequence of atleast 50 nucleotides in length, wherein said nucleotide sequence isgreater than 98% identical to corresponding nucleotides in SEQ ID NO:1,and (b) a nucleotide sequence encoding at least 75 contiguous aminoacids of SEQ ID NO:2, wherein said human nucleotide sequence encodes apolypeptide with hairless transcription factor activity.
 2. Theexpression construct of claim 1, wherein the at least one humannucleotide sequence is at least 250 contiguous nucleotides of SEQ IDNO:1.
 3. The expression construct of claim 2, wherein the at least onehuman nucleotide sequence is at least 500 contiguous nucleotides of SEQID NO:1.
 4. The expression construct of claim 1, wherein the at leastone human nucleotide sequence is SEQ ID NO:1.
 5. The expressionconstruct of claim 1, wherein the at least one human nucleotide sequenceencodes at least 100 contiguous amino acids of SEQ ID NO:2.
 6. Theexpression construct of claim 5, wherein the at least one humannucleotide sequence encodes at least 250 contiguous amino acids of SEQID NO:2.
 7. The expression construct of claim 1, wherein the at leastone human nucleotide sequence encodes SEQ ID NO:2.
 8. The expressionconstruct of claim 1 further comprising an origin of replication and aselectable marker.
 9. The expression construct of claim 1 which is adouble-stranded expression construct further comprising the at least onehuman nucleotide sequence's complement.
 10. A transfected cellcomprising the expression construct of claim
 1. 11. A transfected cellcomprising the expression construct of claim
 8. 12. A transfected cellcomprising the double-stranded expression construct of claim
 9. 13. Amethod of making the transfected cell of claim 10, which is comprised ofan expression construct, comprising: (a) providing a host cell and (b)transfecting the host cell with the expression construct.
 14. The methodof making of claim 13, wherein the host cell is a human cell.
 15. Themethod of making of claim 13, wherein the host cell is without anendogenous Hairless gene.
 16. An in vitro expression system comprising:(a) the expression construct of claim 1 which produces a polypeptidewith Hairless transcription factor activity; (b) a reporter constructcomprising a transcription regulatory region responsive to Hairlesstranscription factor operably linked to a reporter gene; and (c) RNApolymerase, transcription factors, and co-factors which allows thepolypeptide with Hairless transcription activity to regulatetranscription of the reporter gene which is mediated by thetranscription regulatory region responsive to Hairless transcriptionfactor.
 17. A method of screening for a chemical agent which modulatesHairless transcriptional activity comprising: (a) incubating at leastone candidate chemical agent with the transfected cell of claim 10, (b)measuring Hairless transcriptional activity in the transfected cell, and(c) identifying the chemical agent which modulates Hairlesstranscriptional activity in the transfected cell.
 18. The method ofscreening for a chemical agent of claim 17, wherein Hairlesstranscriptional activity in the transfected cell is increased by theidentified chemical agent.
 19. The method of screening for a chemicalagent of claim 17, wherein Hairless transcriptional activity in thetransfected cell is decreased by the identified chemical agent.